<86>Oct 2 00:42:30 userdel[2212245]: delete user 'rooter' <86>Oct 2 00:42:30 userdel[2212245]: removed group 'rooter' owned by 'rooter' <86>Oct 2 00:42:30 userdel[2212245]: removed shadow group 'rooter' owned by 'rooter' <86>Oct 2 00:42:30 groupadd[2212259]: group added to /etc/group: name=rooter, GID=1793 <86>Oct 2 00:42:30 groupadd[2212259]: group added to /etc/gshadow: name=rooter <86>Oct 2 00:42:30 groupadd[2212259]: new group: name=rooter, GID=1793 <86>Oct 2 00:42:30 useradd[2212271]: new user: name=rooter, UID=1793, GID=1793, home=/root, shell=/bin/bash, from=none <86>Oct 2 00:42:30 userdel[2212292]: delete user 'builder' <86>Oct 2 00:42:30 userdel[2212292]: removed group 'builder' owned by 'builder' <86>Oct 2 00:42:30 userdel[2212292]: removed shadow group 'builder' owned by 'builder' <86>Oct 2 00:42:30 groupadd[2212309]: group added to /etc/group: name=builder, GID=1794 <86>Oct 2 00:42:30 groupadd[2212309]: group added to /etc/gshadow: name=builder <86>Oct 2 00:42:30 groupadd[2212309]: new group: name=builder, GID=1794 <86>Oct 2 00:42:31 useradd[2212324]: new user: name=builder, UID=1794, GID=1794, home=/usr/src, shell=/bin/bash, from=none /usr/src/in/srpm/rccl-2.18.6-alt0.1.src.rpm: bad symbols in the license tag: // <13>Oct 2 00:42:34 rpmi: libidn2-2.3.7-alt1 sisyphus+339505.100.1.2 1706718968 installed <13>Oct 2 00:42:34 rpmi: libnettle8-3.9.1-alt1 sisyphus+322548.100.1.2 1686176879 installed <13>Oct 2 00:42:34 rpmi: libp11-kit-1:0.25.5-alt1 sisyphus+352553.100.1.1 1720622573 installed <13>Oct 2 00:42:34 rpmi: libtasn1-4.19.0-alt3 sisyphus+327816.100.1.1 1692802615 installed <13>Oct 2 00:42:34 rpmi: libhogweed6-3.9.1-alt1 sisyphus+322548.100.1.2 1686176879 installed <13>Oct 2 00:42:34 rpmi: libgnutls30-3.8.4-alt1 sisyphus+343729.100.2.1 1711571288 installed <13>Oct 2 00:42:34 rpmi: libngtcp2.16-1.7.0-alt1 sisyphus+356415.200.1.1 1725031912 installed <13>Oct 2 00:42:34 rpmi: libngtcp2_crypto_gnutls8-1.7.0-alt1 sisyphus+356415.200.1.1 1725031912 installed <13>Oct 2 00:42:34 rpmi: cmake-modules-3.29.3-alt1 sisyphus+348648.100.2.1 1716590540 installed <13>Oct 2 00:42:34 rpmi: libuv-1.48.0-alt2 sisyphus+357579.100.1.1 1726426171 installed <13>Oct 2 00:42:34 rpmi: librhash-1.3.5-alt3 sisyphus+286141.40.2.1 1632982456 installed <13>Oct 2 00:42:34 rpmi: libjsoncpp24-1.9.4-alt2 sisyphus+346331.200.2.1 1716448551 installed <13>Oct 2 00:42:34 rpmi: libexpat-2.5.0-alt1 sisyphus+346180.200.2.1 1716349835 installed <13>Oct 2 00:42:34 rpmi: publicsuffix-list-dafsa-20240911-alt1 sisyphus+357399.100.1.1 1726160479 installed <13>Oct 2 00:42:34 rpmi: libpsl-0.21.5-alt1 sisyphus+338474.100.1.1 1705684769 installed <13>Oct 2 00:42:34 rpmi: libnghttp3.9-1.5.0-alt1 sisyphus+356415.100.1.1 1725031855 installed <13>Oct 2 00:42:34 rpmi: libnghttp2-1.63.0-alt1 sisyphus+356414.100.1.1 1725031508 installed <13>Oct 2 00:42:34 rpmi: openldap-common-2.6.8-alt1 sisyphus+351621.100.1.1 1719420449 installed <13>Oct 2 00:42:34 rpmi: libntlm-1.5-alt1 sisyphus+278100.3300.1.1 1626058899 installed <13>Oct 2 00:42:34 rpmi: libidn-1.37-alt2 sisyphus+300849.100.1.1 1653769687 installed <13>Oct 2 00:42:34 rpmi: libverto-0.3.2-alt1_1 sisyphus+321176.2200.10.2 1684803947 installed <13>Oct 2 00:42:34 rpmi: liblmdb-0.9.32-alt1 sisyphus+342426.100.1.1 1710124288 installed <13>Oct 2 00:42:34 rpmi: libkeyutils-1.6.3-alt1 sisyphus+346336.200.2.2 1716472658 installed <13>Oct 2 00:42:34 rpmi: libcom_err-1.46.4.0.5.4cda-alt1 sisyphus+283826.100.1.1 1629975345 installed <13>Oct 2 00:42:34 rpmi: libbrotlicommon-1.1.0-alt1 sisyphus+328501.100.1.1 1693598419 installed <13>Oct 2 00:42:34 rpmi: libbrotlidec-1.1.0-alt1 sisyphus+328501.100.1.1 1693598419 installed <13>Oct 2 00:42:34 rpmi: rpm-macros-cmake-3.29.1-alt1 sisyphus+344518.300.3.1 1712379787 installed <13>Oct 2 00:42:34 rpmi: rpm-macros-alternatives-0.5.2-alt2 sisyphus+315270.200.2.1 1676457367 installed <13>Oct 2 00:42:34 rpmi: alternatives-0.5.2-alt2 sisyphus+315270.200.2.1 1676457367 installed <13>Oct 2 00:42:34 rpmi: ca-certificates-2024.07.01-alt1 sisyphus+351897.100.1.1 1719826350 installed <13>Oct 2 00:42:34 rpmi: ca-trust-0.2.0-alt1 sisyphus+344843.100.1.1 1712743326 installed <13>Oct 2 00:42:34 rpmi: p11-kit-trust-1:0.25.5-alt1 sisyphus+352553.100.1.1 1720622573 installed <13>Oct 2 00:42:34 rpmi: libcrypto3-3.1.7-alt1 sisyphus+356755.100.1.1 1725388416 installed <13>Oct 2 00:42:35 rpmi: libssl3-3.1.7-alt1 sisyphus+356755.100.1.1 1725388416 installed <86>Oct 2 00:42:35 groupadd[2214443]: group added to /etc/group: name=_keytab, GID=999 <86>Oct 2 00:42:35 groupadd[2214443]: group added to /etc/gshadow: name=_keytab <86>Oct 2 00:42:35 groupadd[2214443]: new group: name=_keytab, GID=999 <13>Oct 2 00:42:35 rpmi: libkrb5-1.21.3-alt2 sisyphus+351857.100.1.1 1719735141 installed <13>Oct 2 00:42:35 rpmi: libgsasl-2.2.0-alt1 sisyphus+333173.100.1.1 1698696954 installed <86>Oct 2 00:42:35 groupadd[2214450]: group added to /etc/group: name=sasl, GID=998 <86>Oct 2 00:42:35 groupadd[2214450]: group added to /etc/gshadow: name=sasl <86>Oct 2 00:42:35 groupadd[2214450]: new group: name=sasl, GID=998 <13>Oct 2 00:42:35 rpmi: libsasl2-3-2.1.28-alt2 sisyphus+343335.100.1.1 1711112544 installed <13>Oct 2 00:42:35 rpmi: libldap2-2.6.8-alt1 sisyphus+351621.100.1.1 1719420449 installed <13>Oct 2 00:42:35 rpmi: libarchive13-3.7.5-alt2 sisyphus+358189.100.1.1 1727162763 installed <13>Oct 2 00:42:35 rpmi: libssh2-1.11.0-alt2 sisyphus+339356.100.1.1 1706593137 installed <13>Oct 2 00:42:35 rpmi: libcurl-8.10.0-alt1 sisyphus+357271.100.1.1 1726044759 installed <13>Oct 2 00:42:35 rpmi: cmake-3.29.3-alt1 sisyphus+348648.100.2.1 1716590540 installed <13>Oct 2 00:42:43 rpmi: llvm-common-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Oct 2 00:42:43 rpmi: llvm-rocm-filesystem-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:42:43 rpmi: libnuma-2.0.18-alt1 sisyphus+358102.100.1.1 1727069613 installed <13>Oct 2 00:42:44 rpmi: rocm-device-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:42:44 rpmi: llvm18.1-filesystem-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:42:44 rpmi: clang18.1-support-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:42:44 rpmi: llvm18.1-polly-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:42:44 rpmi: gcc-c++-common-1.4.28-alt1 sisyphus+348678.100.1.1 1716396142 installed <13>Oct 2 00:42:44 rpmi: libstdc++13-devel-13.2.1-alt4 sisyphus+354645.100.1.1 1723060849 installed <13>Oct 2 00:42:44 rpmi: librocm-smi1-6.1.2-alt0.2 sisyphus+352428.100.1.1 1720459745 installed <13>Oct 2 00:42:44 rpmi: libpciaccess-1:0.18.1-alt1 sisyphus+343583.300.1.1 1711440789 installed <13>Oct 2 00:42:44 rpmi: libdrm-1:2.4.123-alt1 sisyphus+357330.40.3.1 1726125397 installed <13>Oct 2 00:42:44 rpmi: libhsakmt1-6.1.2-alt0.1 sisyphus+352247.600.5.1 1720254766 installed <13>Oct 2 00:42:44 rpmi: libhsa-runtime1-6.1.2-alt0.1 sisyphus+352247.1600.9.1 1720269840 installed <13>Oct 2 00:42:44 rpmi: libpci-3.13.0-alt1 sisyphus+350694.100.1.1 1717993339 installed <13>Oct 2 00:42:44 rpmi: pciids-20240913-alt1 sisyphus+357455.100.1.1 1726250568 installed <13>Oct 2 00:42:44 rpmi: pciutils-3.13.0-alt1 sisyphus+350694.100.1.1 1717993339 installed <13>Oct 2 00:42:44 rpmi: libmpdec3-2.5.1-alt3 sisyphus+314490.500.5.1 1675432004 installed <13>Oct 2 00:42:44 rpmi: libgdbm-1.8.3-alt10 sisyphus+346222.200.3.2 1716468404 installed <13>Oct 2 00:42:44 rpmi: libb2-0.98.1-alt1_1 sisyphus+291614.100.1.1 1638962877 installed <13>Oct 2 00:42:44 rpmi: python3-3.12.6-alt1 sisyphus+357228.100.1.1 1725970095 installed <13>Oct 2 00:42:45 rpmi: python3-base-3.12.6-alt1 sisyphus+357228.100.1.1 1725970095 installed <13>Oct 2 00:42:45 rpmi: clang-rocm-libs-support-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:42:48 rpmi: clang-rocm-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:42:48 rpmi: rocminfo-6.1.2-alt0.1 sisyphus+352247.1700.9.1 1720269882 installed <13>Oct 2 00:42:48 rpmi: libedit3-3.1.20230828-alt1 sisyphus+330914.200.3.1 1696922743 installed <13>Oct 2 00:42:48 rpmi: llvm18.1-gold-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:42:50 rpmi: llvm18.1-libs-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:42:50 rpmi: libclang-cpp18-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:42:50 rpmi: clang18.1-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:42:50 rpmi: clang-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Oct 2 00:42:52 rpmi: clang-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:42:53 rpmi: llvm18.1-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:42:53 rpmi: llvm-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Oct 2 00:43:06 rpmi: llvm-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:43:07 rpmi: libclang18-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:43:07 rpmi: clang18.1-devel-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:43:07 rpmi: clang-devel-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Oct 2 00:43:08 rpmi: clang18.1-tools-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:43:08 rpmi: clang-tools-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Oct 2 00:43:15 rpmi: clang-rocm-tools-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:43:15 rpmi: lld18.1-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:43:15 rpmi: lld-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Oct 2 00:43:16 rpmi: lld-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:43:17 rpmi: libamd_comgr2-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:43:18 rpmi: llvm-rocm-gold-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:43:19 rpmi: llvm-rocm-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:43:19 rpmi: hip-runtime-amd-6.1.2-alt0.2 sisyphus+352364.100.1.1 1720383820 installed <13>Oct 2 00:43:19 rpmi: hipcc-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:43:22 rpmi: mlir18.1-tools-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:43:40 rpmi: llvm18.1-devel-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Oct 2 00:43:40 rpmi: llvm-devel-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Oct 2 00:43:53 rpmi: llvm-rocm-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:43:53 rpmi: hip-devel-6.1.2-alt0.2 sisyphus+352364.100.1.1 1720383820 installed <13>Oct 2 00:43:53 rpmi: rocm-comgr-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:44:03 rpmi: clang-rocm-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 2 00:44:04 rpmi: hipify-clang-6.1.2-alt0.1 sisyphus+352428.200.1.1 1720459887 installed <13>Oct 2 00:44:04 rpmi: hsa-rocr-devel-6.1.2-alt0.1 sisyphus+352247.1600.9.1 1720269840 installed <13>Oct 2 00:44:04 rpmi: librocm-smi-devel-6.1.2-alt0.2 sisyphus+352428.100.1.1 1720459745 installed <13>Oct 2 00:44:04 rpmi: libstdc++-devel-13-alt1 sisyphus+323337.300.1.1 1687267966 installed <13>Oct 2 00:44:04 rpmi: rocm-cmake-6.1.2-alt0.1 sisyphus+352247.100.1.1 1720180839 installed Building target platforms: x86_64 Building for target x86_64 Wrote: /usr/src/in/nosrpm/rccl-2.18.6-alt0.1.nosrc.rpm (w1.gzdio) Installing rccl-2.18.6-alt0.1.src.rpm Building target platforms: x86_64 Building for target x86_64 Executing(%prep): /bin/sh -e /usr/src/tmp/rpm-tmp.97734 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + rm -rf rccl-2.18.6 + echo 'Source #0 (rccl-2.18.6.tar):' Source #0 (rccl-2.18.6.tar): + /bin/tar -xf /usr/src/RPM/SOURCES/rccl-2.18.6.tar + cd rccl-2.18.6 + /bin/chmod -c -Rf u+rwX,go-w . + subst 's,cat ${ROCM_PATH}/.info/version,echo 6.1.2,' CMakeLists.txt + exit 0 Executing(%build): /bin/sh -e /usr/src/tmp/rpm-tmp.97734 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd rccl-2.18.6 + export ALTWRAP_LLVM_VERSION=rocm + ALTWRAP_LLVM_VERSION=rocm + mkdir -p x86_64-alt-linux + cmake -DCMAKE_SKIP_INSTALL_RPATH:BOOL=yes '-DCMAKE_C_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' '-DCMAKE_CXX_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' '-DCMAKE_Fortran_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' -DCMAKE_INSTALL_PREFIX=/usr -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_DESTINATION=lib64 -DLIB_SUFFIX=64 -S . -B x86_64-alt-linux -Wno-dev -DROCM_PATH=/usr -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ -DCMAKE_INSTALL_LIBDIR=lib64 -DENABLE_MSCCL_KERNEL=ON -- The CXX compiler identification is Clang 17.0.0 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/clang++ - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- Checking for ROCm support for GPU targets: -- Performing Test COMPILER_HAS_TARGET_ID_gfx803 -- Performing Test COMPILER_HAS_TARGET_ID_gfx803 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx900_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx900_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx906_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx906_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx908_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx908_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx940 -- Performing Test COMPILER_HAS_TARGET_ID_gfx940 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx941 -- Performing Test COMPILER_HAS_TARGET_ID_gfx941 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 - Success -- Compiling for gfx803;gfx900:xnack-;gfx906:xnack-;gfx908:xnack-;gfx90a:xnack-;gfx90a:xnack+;gfx940;gfx941;gfx942;gfx1030;gfx1100;gfx1101;gfx1102 -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- ROCM_PATH found: /usr -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP compiler: clang -- HIP runtime: rocclr -- hipcc executable: /usr/bin/hipcc -- hipcc version: 6.1.40093 -- ROCm version: 6.1.2 ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:79 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:80 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipEventDisableSystemFence -- Looking for hipEventDisableSystemFence - not found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:84 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:79 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:80 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipDeviceMallocUncached -- Looking for hipDeviceMallocUncached - not found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:84 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- HSA runtime: /usr/include -- Found rocm_smi at /usr/include -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h - found -- Performing Test HAVE_KERNARG_PRELOAD -- Performing Test HAVE_KERNARG_PRELOAD - Success -- Kernarg preloading to SGPR enabled -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.h -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/host_table.cpp -- HIP_UNCACHED_MEMORY enabled -- RCCL LL128 protocol enabled -- Building shared RCCL library -- rocm-cmake: Set license file to /usr/src/RPM/BUILD/rccl-2.18.6/LICENSE.txt. -- Configuring done (15.8s) -- Generating done (0.0s) CMake Warning: Manually-specified variables were not used by the project: CMAKE_C_COMPILER CMAKE_C_FLAGS CMAKE_Fortran_FLAGS LIB_DESTINATION LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux + cmake --build x86_64-alt-linux --verbose --parallel 16 Change Dir: '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j16 gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -S/usr/src/RPM/BUILD/rccl-2.18.6 -B/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux --check-build-system CMakeFiles/Makefile.cmake 0 gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -E cmake_progress_start /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux//CMakeFiles/progress.marks gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/Makefile2 all /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/depend gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' cd /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles/git_version_check.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/build gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Updating git_version.cpp if necessary /usr/bin/cmake -P /usr/src/RPM/BUILD/rccl-2.18.6/cmake/git_version.cmake -- Updating git_version.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[1]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Built target git_version_check gmake[1]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/depend gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/all_gather.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_gather.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/all_reduce.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_reduce.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/all_to_allv.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_to_allv.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/device/alltoall_pivot.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/alltoall_pivot.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/channel.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/channel.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/device/broadcast.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/broadcast.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/all_to_all.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_to_all.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/device/all_gather.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/all_gather.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/onerank_reduce.cu -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/onerank_reduce.cu -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/transport/shm.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/shm.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/bootstrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/bootstrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/device/all_reduce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/all_reduce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/common.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/common.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/msccl_kernel_impl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/msccl_kernel_impl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/common_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/common_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/broadcast.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/broadcast.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/gather.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/gather.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/primitives.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/primitives.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/op128.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/op128.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/op128.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/op128.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/reduce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/device/reduce_scatter.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce_scatter.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/device/sendrecv.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/sendrecv.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_ll.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_ll.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/prims_ll128.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_ll128.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/reduce.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/reduce.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/msccl.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/msccl.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/scatter.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/scatter.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_simple.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_simple.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/reduce_scatter.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/reduce_scatter.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/device/reduce_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/collectives/sendrecv.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/sendrecv.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/debug.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/debug.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/rings.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rings.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/rings.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rings.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/graph/connect.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/connect.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/paths.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/paths.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/rome_models.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rome_models.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/topo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/topo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/enqueue.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/enqueue.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/graph/xml.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/xml.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/include/BfdBacktrace.hpp -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/BfdBacktrace.hpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/BfdBacktrace.hpp -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/BfdBacktrace.hpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/graph/trees.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/trees.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/topo.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/topo.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/rome_models.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rome_models.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/graph/tuning.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/tuning.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/search.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/search.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/graph/xml.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/xml.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/group.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/group.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/include/align.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/align.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/align.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/align.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/alloc.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/alloc.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/archinfo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/archinfo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/archinfo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/archinfo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/argcheck.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/argcheck.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/bootstrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/bootstrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/bootstrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/bootstrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/channel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/channel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/checks.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/checks.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/coll_net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/coll_net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/cpuset.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/cpuset.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/cpuset.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/cpuset.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/core.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/core.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/comm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/comm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/debug.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/debug.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/debug.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/debug.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/enqueue.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/enqueue.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/collectives.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/collectives.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/collectives.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/collectives.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/devcomm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/devcomm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/devcomm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/devcomm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/git_version.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/git_version.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/git_version.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/git_version.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/gdrwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/gdrwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/graph.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/graph.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/group.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/group.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ibvsymbols.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvsymbols.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvsymbols.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvsymbols.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ibvwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/info.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/info.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ipcsocket.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ipcsocket.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ipcsocket.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ipcsocket.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ibvcore.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvcore.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvcore.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvcore.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_lifecycle.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_lifecycle.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_lifecycle.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_lifecycle.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_scheduler.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_scheduler.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_scheduler.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_scheduler.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_parser.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_parser.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_status.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_status.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_status.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_status.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/msccl/msccl_setup.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_setup.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_setup.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_setup.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/msccl/msccl_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_struct.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_struct.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_struct.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_struct.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/nccl_net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nccl_net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nccl_net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nccl_net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit_struct.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_struct.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit_struct.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_struct.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit_event.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_event.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit_event.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_event.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvmlwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvmlwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvmlwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvmlwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExtCuda.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCuda.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtCuda.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCuda.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtCudaRt.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCudaRt.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtCudaRt.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCudaRt.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtOpenCL.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtOpenCL.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtOpenCL.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtOpenCL.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExt.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExt.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExt.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExt.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtSync.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtSync.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtSync.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtPayload.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtPayload.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtPayload.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtPayload.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCore.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCore.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImpl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImpl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 24%] Hipifying src/include/nvtx3/nvtx3.hpp -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtx3.hpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtx3.hpp -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtx3.hpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxTypes.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxTypes.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx_stub.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx_stub.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx_stub.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx_stub.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/p2p.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/p2p.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/p2p.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/p2p.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/param.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/param.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/param.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/param.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/profiler.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/profiler.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/proxy.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/proxy.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rccl_bfloat16.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_bfloat16.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rccl_bfloat16.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_bfloat16.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rocm_smi_wrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocm_smi_wrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rocm_smi_wrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocm_smi_wrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rccl_vars.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_vars.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rccl_vars.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_vars.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/rocmwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocmwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rocmwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocmwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/shm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/shm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/shm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/shm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/signals.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/signals.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/signals.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/signals.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/socket.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/socket.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/socket.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/socket.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/strongstream.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/strongstream.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/strongstream.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/strongstream.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/timer.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/timer.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/timer.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/timer.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/trees.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/trees.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/trees.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/trees.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/transport.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/transport.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/include/utils.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/utils.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/misc/archinfo.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/archinfo.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/misc/argcheck.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/argcheck.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ipcsocket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ipcsocket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ibvwrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ibvwrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_lifecycle.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_lifecycle.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ibvsymbols.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ibvsymbols.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_status.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_status.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/init.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/init.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_parser.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_parser.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/npkit.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/npkit.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_setup.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_setup.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/nvmlwrap_stub.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/nvmlwrap_stub.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/param.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/param.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/rocm_smi_wrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/rocm_smi_wrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/profiler.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/profiler.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/signals.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/signals.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/rocmwrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/rocmwrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/shmutils.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/shmutils.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/utils.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/utils.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/strongstream.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/strongstream.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/socket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/socket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net_socket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net_socket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/transport.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/nvls.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/nvls.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net_ib.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net_ib.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/transport/coll_net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/coll_net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/proxy.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/proxy.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 41%] Hipifying src/transport/p2p.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/p2p.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' cd /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles/rccl.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/build gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ 3 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/debug.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/debug.cc.o -MF CMakeFiles/rccl.dir/hipify/src/debug.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/debug.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ 3 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ 3 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ 3 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ 4 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx941. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx906. 5 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx803. 5 warnings generated when compiling for gfx941. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx940. 5 warnings generated when compiling for gfx1030. 5 warnings generated when compiling for gfx908. 5 warnings generated when compiling for gfx1101. 5 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const In file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cch:a6r: *In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.ht:a10g: NIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.hm:e11,: In file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.ht:r12u: cIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h :m124s: cIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.hl:X14m: lIn file included from N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.ho:d60e: *In file included from */usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h :n14o: d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.he:)40 :{13 : | warning: ^~~~~~~~~~~~~~~unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx941. 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for host. 4 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/channel.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/channel.cc.o -MF CMakeFiles/rccl.dir/hipify/src/channel.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/channel.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 8 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 8 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 8 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* aIn file included from ttrNa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.ccm:e8,: In file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.ho:n60s: tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.hi:n14t: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.ha:t40t:r13V:a lwarning: uunused function 'log2i' [-Wunused-function]e ) { | ^~~~~~~~~~~~~~ 40 | static lon/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hg: 240l:o21g:2 iwarning: (unused function 'xmlAddNode' [-Wunused-function]l ong n) { | 240 ^~~~~ | static ncclResult_tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.ccx:m10l: A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hd:d217N:o21d:e (warning: sunused function 'ncclTopoDevToRank' [-Wunused-function]t ruct n c217c | lsXtmalt*i cx mnlc,c lsRtersuucltt _ntc cnlcXcmllTNoopdoeD*e vpTaorReanntk,( sctornusctt cnhcacrl*T ospuobSNyasmtee,m *s tsryusctte mn,c cilnXtm ldNeovd,e *i*n ts*u br)a n{k ) | { ^~~~~~~~~~ | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h :256:21: In file included from warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.ccunused function 'xmlRemoveNode' [-Wunused-function]: 13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41 :25621 | :s twarning: aunused function 'ncclChannelCompute' [-Wunused-function]t ic nccl R41e | ssutlatt_itc xnmclcRleRmeosvuelNto_dte (nsctcrluCchta nnncecllCXommlpNuotdee(*s tnroudcet) n{c c l| C ^~~~~~~~~~~~~o mm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h*: 276c:o21m:m ,warning: unused function 'kvConvertToInt' [-Wunused-function]i nt pee r276, | sitnatt icch anncncellRIenscu,l ti_ntt kcvoCloln,v eirnttT*ocIhnatn(nceolnIsdt) c{h a r| * ^~~~~~~~~~~~~~~~~~ str,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cci:n14t: */usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h :v94a:l21u:e ,warning: unused function 'xmlGetAttrInt' [-Wunused-function]s truct k94v | Dsitcatt*i cd incctc)l R{e s u| l ^~~~~~~~~~~~~~t _t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h :x289m:l21G:e twarning: Aunused function 'kvConvertToStr' [-Wunused-function]t trInt (289s | tsrtuactti cn cncclcXlmRleNsoudlet*_ tn okdveC,o ncvoenrsttT ocShtarr(*i natt tvraNlaumee,, cionnts*t vcahlaure*)* {s t r| , ^~~~~~~~~~~~~ st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hr:u101c:t21 :k vwarning: Dunused function 'xmlGetAttrIntDefault' [-Wunused-function]i ct* dic t101) | s{t a t| i ^~~~~~~~~~~~~~c ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncc:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ lXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ 20 warnings generated when compiling for gfx940. 20 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 2020 warnings generated when compiling for gfx1030. warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ 20 warnings generated when compiling for gfx1101. 20 warnings generated when compiling for gfx906. 20 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx941. 20 warnings generated when compiling for gfx803. 20 warnings generated when compiling for gfx90a. 20 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx942. 20 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | 9 double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ 9 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx942. 9 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.ccd:i773c:t7): {warning: unused variable 'nChannels' [-Wunused-variable] | ^~~~~~~~~~~~~~ 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ 28 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ 28 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx908. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx803. 28 warnings generated when compiling for gfx940. 28 warnings generated when compiling for gfx1102. 28 warnings generated when compiling for gfx1030. 28 warnings generated when compiling for gfx941. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx906. 28 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx942. 28 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/group.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/group.cc.o -MF CMakeFiles/rccl.dir/hipify/src/group.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/group.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) {In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx940. 2 warnings generated when compiling for gfx900. 2 warnings generated when compiling for gfx941. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, conIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ st char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx803. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx940. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx900. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx942. 10 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | sIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ tatic ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 8 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx942. 8 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h::826: :In file included from 21/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:: 60warning: : unused function 'collNetIallreduce' [-Wunused-function]In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function]26 | stat i40c | sntcactliRce sluolntg_ tl ocgo2lil(NleotnIga lnl)r e{d u c| e ^~~~~( struct ncIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.ccl:C10o: m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hm:*217 :c21o:m mwarning: ,unused function 'ncclTopoDevToRank' [-Wunused-function] void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(sIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ truct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, cIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetNIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.ccm:e8(: sIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.hr:u60c: tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.hn:c14c: l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.hC:o40m:m13*: cwarning: ounused function 'log2i' [-Wunused-function]m m) { re t40u | rsnt actoimcm -l>onncgc llCoogl2liN(elto-n>gn anm)e ;{ } | ^~~~~| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21:In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.ccwarning: :unused function 'collNetListen' [-Wunused-function]10 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: 19warning: | unused function 'ncclTopoDevToRank' [-Wunused-function]s tatic n c217c | lsRteastuilct _ntc ccloRlelsNuelttL_its tnecnc(lsTtorpuocDte vnTcocRlaCnokm(ms*t rcuocmtm ,n cicnltT odpeovS,y svtoeimd** shyasntdelme,, ivnoti dd*e*v ,l iisntte*n Croamnmk)) {{ N C| C ^~~~~~~~~~~~~~~~~L CHECK(In file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cco:m14m: -/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h>:n16c:c20l:C owarning: lunused function 'collNetName' [-Wunused-function]l Net->li s16t | esnt(adteivc, choannsdtl ec,h alri*s tceonlCloNmemt)N)a;m er(estturrunc tn cncclcSluCcocmems*s ;c o}m m )| ^~~~~~~~~~~~~{ r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.he:t20u:r21n: cwarning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, vIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cci:d8*: In file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.he:n60d: DIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.ht:a14,: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.hv:o40i:d13*: rwarning: eunused function 'log2i' [-Wunused-function]c vData, 40i | nstt actoiucn tl,o nngc clloDga2tia(Tlyong n) { p| e ^~~~~_ t dataTyIn file included from p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cce:,10 : n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hc:c217l:R21e:d Owarning: punused function 'ncclTopoDevToRank' [-Wunused-function]_ t redO p217, | svtoaitdi*c snecncdlMRheasnudllte_,t vnocicdl*T orpeocDveMvhTaonRdalnek,( s tvrouicdt* *n crcelqTuoepsotS)y s{t e m| * ^~~~~~~~~~~~~~~~~ syst/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.he:m28,: 21i:n twarning: unused function 'collNetIflush' [-Wunused-function]d ev, int* r a28n | ks)t a{t i c| ^~~~~~~~~~~~~~~~~n cclResIn file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.ccl:t14_: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h :c16o:l20l:N ewarning: tunused function 'collNetName' [-Wunused-function]I flush (16s | tsrtuactti cn cccolnCsotm mc*h acro*m mc,o lvloNiedt*N acmoel(lsCtormumc,t vnocicdl*C odmamt*a ,c oimnmt) s{i zree,t uvroni dc*o mmmh-a>nndclcel,C ovloliNde*t*- >rneaqmuee;s t}) {| ^~~~~~~~~~~N CCLC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hH:E19C:K21(:c owarning: munused function 'collNetListen' [-Wunused-function]m ->ncclCo l19l | Nsetta-t>iicf lnucschl(RceoslullCto_mtm ,c odlaltNae,t Lsiisztee,n (mshtarnudclte ,n crcelqCuoemsmt*) )c;o mrme,t uirnnt ndcecvl,S uvcocieds*s ;h a}n d l| e ^~~~~~~~~~~~~, voi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hd:*29*: 21l:i swarning: tunused function 'collNetTest' [-Wunused-function]e nComm) {29 | NsCtCaLtCiHcE CnKc(ccloRmems-u>lntc_ctl CcoollllNNeett-T>elsits(tsetnr(udcetv ,n chcalnCdolmem,* lciosmtme,n Cvoomimd)*) ;r erqeuteusrtn, nicnctl*S udcocnees,s ;i n}t * | s ^~~~~~~~~~~~~i ze) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h{: 20N:C21C:L Cwarning: Hunused function 'collNetConnect' [-Wunused-function]E CK(comm->n c20c | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* dIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResulonst char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ omm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->nclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ ata, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlt_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ clCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ UnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrVaIn file included from lue/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc): 8{: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h| : ^~~~~~~~~~~~60 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h14:: 144/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h::2140:: 13warning: :unused function 'xmlSetAttr' [-Wunused-function] warning: unused function 'log2i' [-Wunused-function] 144 | static n40c | csltRaetsiucl tl_otn gx mlloSge2tiA(tltorn(gs tnr)u c{t n| c ^~~~~c lXmlNode* node, In file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cco:n10s: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h :c217h:a21r:* warning: aunused function 'ncclTopoDevToRank' [-Wunused-function]t trName ,217 | csotnastti cc hnacrc*l Rveasluulet)_ t{ n c| c ^~~~~~~~~~l TopoD/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.he:v157T:o21R:a nwarning: kunused function 'xmlSetAttrIfUnset' [-Wunused-function]( struct n157c | csltTaotpiocS ynsctcelmR*e ssuylstt_etm ,x milnSte tdAetvt,r IifnUtn*s erta(nskt)r u{c t | n ^~~~~~~~~~~~~~~~~c clXmlNIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.ccd:e14*: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hn:o16d:e20,: cwarning: ounused function 'collNetName' [-Wunused-function]n st char *16 | astttartNiacm ec,o ncsotn scth acrh*a rc*o lvlaNleuteN)a m{e ( s| t ^~~~~~~~~~~~~~~~~r uct n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hc:c182l:C21o:m mwarning: *unused function 'xmlSetAttrFloat' [-Wunused-function] comm) {182 | rsettautrinc cnocmcml-R>enscuclltC_otl lxNmeltS-e>tnAatmter;F l}o a t| ( ^~~~~~~~~~~s truc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ht: 19n:c21c:l Xwarning: munused function 'collNetListen' [-Wunused-function]l Node* nod e19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ , const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx908. 23 warnings generated when compiling for gfx90a. 23 warnings generated when compiling for gfx900. 23 warnings generated when compiling for gfx941. 23 warnings generated when compiling for gfx1100. 23 warnings generated when compiling for gfx940. 23 warnings generated when compiling for gfx1030. 23 warnings generated when compiling for gfx1102. 23 warnings generated when compiling for gfx906. 23 warnings generated when compiling for gfx90a. 23 warnings generated when compiling for gfx803. 23 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx942. 23 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ 3 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/init.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc::12812:: 21/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:: 168warning: :unused function 'xmlFindTagKv' [-Wunused-function]14 : warning: unused variable 'info' [-Wunused-variable] 128 | stati c168 | n c cgldRre_siunlfto__tt ximnlfFoi;n d T| a ^~~~g Kv(st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.hr:u170c:t12 :n cwarning: cunused variable 'mh' [-Wunused-variable]l Xml *170 | x m lg,d rc_omnhs_tt cmhha;r * | t ^~a gNa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.hm:e172,: 9s:t rwarning: uunused variable 'gdrMap' [-Wunused-variable]c t n c172c | l X mvloNiodd e**g*d rnMoadpe;, c| o ^~~~~~n st char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h(:s201t:r19u:c twarning: unused variable 'md' [-Wunused-variable]n cclXm l201N | o d eg*d rn_omdeem,_ dceosncs_tt c*hmadr *= a(tgtdrrN_ammeem,_ dceosncs_tt *c)hgadrr*H avnadlluee;) {| ^~ | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc ^~~~~~~~~~~~: 1995:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h26::207 :warning: 21unused variable 'payload' [-Wunused-variable]: warning: unused function 'xmlGetSub' [-Wunused-function] 1995207 | | s tNavttixcP anrcacmlsRCeosmumlItn_itt RxamnlkG eptaSyulbo(asdt{rmuycrta nnkc,c lnXrmalnNkosd,e *c undoadDee,v }c;o n s| t ^~~~~~~ char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, cons/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cct: 1675c:h11a:r *warning: *unused variable 'stackSize' [-Wunused-variable] str, s t1675r | u c ti nktv6D4i_ctt *s tdaiccktS)i z{e ; | ^~~~~~~~~~~~~~| ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.ccwarning: :unused variable 'devProp' [-Wunused-variable]782 :21: 1676warning: | unused function 'collNetTrySetup' [-Wunused-function] hipDev i782c | esPtraotpi_ct ndcecvlPRreospu;l t _| t ^~~~~~~ collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc| ^~~~~~~~~~~~~~~~~: 1675:11:In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.ccwarning: :unused variable 'stackSize' [-Wunused-variable]38 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: 1675warning: | unused function 'xmlGetAttrInt' [-Wunused-function] int64 _94t | ssttaatcikcS inzcec;l R e| s ^~~~~~~~~u lt_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc :x1676m:l19G:e twarning: Aunused variable 'devProp' [-Wunused-variable]t trIn t1676( | s t rhuicptD envciccleXPmrloNpo_dte *d envoPdreo,p ;c o n| s ^~~~~~~t char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult45_ warningts generatedc when compiling for ogfx906l. lNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constex/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ pr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ 45 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.ccC:L1675C:H11E:C Kwarning: (unused variable 'stackSize' [-Wunused-variable]c omm- >1675n | c c liCnotl6l4N_ett -s>tcalcoksSeiCzoel;l ( c| o ^~~~~~~~~l lCo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.ccm:m1676):)19;: rwarning: eunused variable 'devProp' [-Wunused-variable]t urn 1676n | c c lhSiupcDceevsisc;e P}r o p| _ ^~~~~~~~~~~~~~~~t devPr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ho:p31;: 21 :| ^~~~~~~warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhand45l warninges, generated when compiling for vgfx803o. id** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ Name, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ 45 warnings generated when compiling for gfx90a. 45 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx900. 45 warnings generated when compiling for gfx90a. 45 warnings generated when compiling for gfx1100. 45 warnings generated when compiling for gfx941. 45 warnings generated when compiling for gfx908. 45 warnings generated when compiling for gfx1030. 45 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 45 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -MF CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hunused variable 'info' [-Wunused-variable]: 271:14: note: initialize the variable 'dst' to silence this warning 168271 | | g d rv_oiindf o*_dts ti,n f*os;r c ;| ^~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h :| 170 = nullptr: 12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ * props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc*: 330n:d34e:v )warning: suggest braces around initialization of subobject [-Wmissing-braces]{ NCCLC H330E | C K (sctormumc-t> nnccccllCWoolrlkNEelte-m>Pd2epv iecleesm( n=d e{v0)});; r| e ^t u r| n {} ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc26: | 10s: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ha:t16i:c20 :n cwarning: cunused function 'collNetName' [-Wunused-function]l Result_t c16o | lsltNaettiIca lclornesdtu cceh(asrt*r uccotl lnNcectlNCaommem(*s tcroumcmt, nvcocildC*o mcmo*l lcCoommmm), {v oriedt*u rsne ncdoDmamt-a>,n cvcoliCdo*l lrNeectv-D>antaam,e ;i n}t c| o ^~~~~~~~~~~u nt, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hn:c17c:l21D:a twarning: aunused function 'collNetDevices' [-Wunused-function]T ype_t d17a | tsatTaytpiec, nnccccllRReesduOlpt__tt rceodlOlpN,e tvDoeivdi*c esse(nsdtMrhuacntd lnec,c lvCooimdm** rceocmvmM,h ainndtl*e ,n d evvo)i d{* *N CrCeLqCuHeEsCtK)( c{omim->nndcecxl)C o{l l N| e ^~~~~~~~~~~~~~~~~~~t ->devices(nd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.he:v217):)21;: rwarning: eunused function 'ncclTopoDevToRank' [-Wunused-function]t urn nc c217l | Ssutcacteiscs ;n c}c l R| e ^~~~~~~~~~~~~~s ult_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h :n18c:c21l:T owarning: punused function 'collNetGetProperties' [-Wunused-function]o DevToRank( s18t | rsutcatt incc cnlcTcolpRoeSsyusltte_mt* csoylsltNeemt,G eitnPtr odpeevr,t iienst(*s trraunckt) n{c c l| C ^~~~~~~~~~~~~~~~~o mm* co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hm:m229,: 14i:n twarning: unused function 'ncclTopoXGMISpeed' [-Wunused-function]d ev, n229c | csltNaettiPcr ofpleoratti ensc_ctl*T opprooXpGsM)I S{p eNeCdC(LcCoHnEsCtK (cchoamrm*- >gnccnc)l C{o l l| N ^~~~~~~~~~~~~~~~~e t->getProperIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cci:e14s: (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.hd:e161v:,14 :p rwarning: ounused function 'ncclGdrInit' [-Wunused-function]p s)); 161r | esttuartni cn cgcdlrS_utc cnecscsl;G d}r I n| i ^~~~~~~~~~~~~~~~~~~~t () {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h : 19| : ^~~~~~~~~~~21 : warning: unused function 'collNetListen' [-Wunused-function]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h :206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 19 | s t206a | tsitca tniccc lnRcecsluRlets_utl tc_otl lnNcectlLGidsrtCeund(asFtrreuec(tv onicdc*l CgodmrmH*a ncdolmem), {i n t| ^~~~~~~~~~~~~~~d ev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr omm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t cIn file included from oll/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.ccN:e10t: Ia/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hl:l16r:e20d:u cwarning: eunused function 'collNetName' [-Wunused-function]( struct ncclCo m16m | *s tcaotmimc, cvoonisdt* cchoalrl*C ocmoml,l NveotiNda*m es(esntdrDuactta ,n cvcoliCdo*m mr*e ccvoDmamt)a ,{ irnett ucronu ncto,m mn-c>cnlcDcaltCaoTlylpNee_tt- >dnaatmaeT;y p}e , | n ^~~~~~~~~~~c clRedOp_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ht: 17r:e21d:O pwarning: ,unused function 'collNetDevices' [-Wunused-function] void* s e17n | dsMthaatnidcl en,c cvloRieds*u lrte_ctv MchoalnldNleet,D e vviocieds*(*s trreuqcute sntc)c l{C o m| m ^~~~~~~~~~~~~~~~~* comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h,: 28i:n21t:* warning: nunused function 'collNetIflush' [-Wunused-function]d ev) { NCCLC H28E | CsKt(actoimcm -n>cncclcRleCsoullltN_ett -c>odlelvNiecteIsf(lnudsehv()s)t;r urcett unrcnc lnCcocmlmS*u cccoemsms,; v}o i d| * ^~~~~~~~~~~~~~ coll/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hC:o18m:m21,: vwarning: ounused function 'collNetGetProperties' [-Wunused-function]i d* data, i18n | ts tsaitziec, nvcocildR*e smuhlatn_dtl ec,o lvloNiedt*G*e trPerqoupeesrtt)i e{s (NsCtCrLuCcHtE CnKc(ccloCmomm-m>*n cccolmCmo,l liNnett -d>eivf,l unschc(lcNoeltlPCroompme,r tdiaetsa_,t *s ipzreo,p sm)h a{n dNlCeC,L CrHeEqCuKe(scto)m)m;- >rnectculrCno lnlcNceltS-u>cgceetsPsr;o p}e r t| i ^~~~~~~~~~~~~e s(dev/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h,: 29p:r21o:p swarning: )unused function 'collNetTest' [-Wunused-function]) ; return 29n | csctlaStuiccc enscsc;l R}e s u| l ^~~~~~~~~~~~~~~~~~~~t _t c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ho:l19l:N21e:t Twarning: eunused function 'collNetListen' [-Wunused-function]s t(struct 19 | sncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ tatic ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h14::217 :note: 21initialize the variable 'dst' to silence this warning: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 271 | v217o | isdt a*tdisct ,n c*cslrRce;s u l| t ^_ t | n = nullptrc clTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx1030. 28 warnings generated when compiling for gfx908. 28 warnings generated when compiling for gfx941. 28 warnings generated when compiling for gfx906. 28 warnings generated when compiling for gfx1102. 28 warnings generated when compiling for gfx940. 28 warnings generated when compiling for gfx1101. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx803. 28 warnings generated when compiling for gfx900. 28 warnings generated when compiling for gfx1100. 28 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 28 warnings generated when compiling for host. 28 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx941. 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18c: t /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hm:s195c:c21l:X mwarning: lunused function 'ncclTopoIdToIndex' [-Wunused-function]N ode** node) { 195| | ^~~~~~~~~~~~~~~s tatic ncclResult_t ncclT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cco:p27o:I20d:T owarning: Iunused variable 'mscclAlgoFilePathEnv' [-Wunused-variable]n dex(struc t27 | nsctcaltTiocp ocSoynsstte mc*h asry*s tmesmc,c liAnltg otFyiplee,P aitnhtE6n4v_ t= i"dM,S CiCnLt_*A LiGnOd_eFxI)L E{_ P A| T ^~~~~~~~~~~~~~~~~H "; | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx942. 9 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx1101. 5 warnings generated when compiling for gfx1102. 5 warnings generated when compiling for gfx1030. 5 warnings generated when compiling for gfx941. 5 warnings generated when compiling for gfx1100. 5 warnings generated when compiling for gfx908. 5 warnings generated when compiling for gfx803. 5 warnings generated when compiling for gfx906. 5 warnings generated when compiling for gfx940. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx900. 5 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for host. 5 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* suIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ bName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx942. 17 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/proxy.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -MF CMakeFiles/rccl.dir/hipify/src/proxy.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:187:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 187 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllGather, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:187:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 187 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllGather, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx1100. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx900. 21 warnings generated when compiling for gfx1101. 21 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx940. 21 warnings generated when compiling for gfx906. 21 warnings generated when compiling for gfx941. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx1030. 21 warnings generated when compiling for gfx1102. 21 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ *)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx803. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx941. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx940. 11 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested hereIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :45110 | : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h : 169 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h : 509p:r29i:m swarning: (field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]t id, nthread s507, | t r e et-i>dd(otwind,) ,t rneteh-r>edaodwsn(, anrtghsr-e>asdesn)d,b uwfifd,( tairdg%sW-A>RrPe_cSvIbZuEf)f,, argwsa-r>pr(etdiOdp/AWrAgR)P;_ S I| Z ^E ), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ~~~~~~~~~~~~~~~~~~: 994 :| 5 stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t): note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here508 | 994w | a r p I nrBulnoTcrke(etShprleiatd ( a| r warp(tid/WARP_SIZEg s); 509| | ^ fla/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:T202h:r53e:a dnote: (in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here( tid%4 )202= | = 3 ) , g r o uRpu(ngWroorukpE)l,e m e| n ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~t < F| n warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3, T, R e510d | O p , Asltgeop,S iPzreo(tnoc>c(l)S.hrmuenm(.wceo)m;m . b| u ^f fSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpps:[5N:C1C:L _note: Pin instantiation of member function 'RunWork, 0, 1>::run' requested hereR OTO_ L5L | 1I2M8P]L/_NCCOCLLL__SFTUENPCS(/AslilzReeodfu(cuei,n tT6R4E_Et,) )L L{1 2 8| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ S u| m group(groupP ostDiv, in/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:8533_:t9): note: | in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :533391 | : 95 : note: expanded from macro 'IMPL_COLL_FUNC' prims (391t | i d -RnutnhWroerakdyuppe,> ,t rNeCeC-L>_dAoLwGnO,_ #a#raglsg-o>,s eNnCdCbLu_fPfR,O TaOr_g#s#-p>rroetcov>b(u)f.fr,u n (| & ^n cclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:m994e:m5.:w onote: rin instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested herek ); \994 | | ^ runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement509(:)29.:r uwarning: nfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]( we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp :5079 | : 1 : note: tin instantiation of member function 'RunWork, 0, 1>::run' requested herei d(t i9d | )I,M PnLt_hCrOeLaLd_sF(UnNtCh(rAelaldRse)d,uce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cppz:e1o: fIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:i10n: tIn file included from 6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h4:_169t: )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h): 509{: 29 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 533507: | 9 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested heretid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 25 warnings generated when compiling for gfx90a. 25 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ g); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ dop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSi7 warnings generated when compiling for gfx1100. ze(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ e, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid;In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ L_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threa7 warnings generated when compiling for gfx1102. dIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ , flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h386: | 514 : 9 : iwarning: nvariable 'offset' set but not used [-Wunused-but-set-variable]t wire O514f | f s e t i=n tW iorfefWsoertd P=e rtSildi;c e *| w ^a rp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ o, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | 7 warnings generated when compiling for gfx906. warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NC7 warnings generated when compiling for gfx941. CL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.bufIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ fSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :153514 | : 9 : warning: uvariable 'offset' set but not used [-Wunused-but-set-variable]i nt32 _514t | d a t ai1n,t folfafgs1e,t d=a ttai2d,; f l| a ^g 2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int327 warnings generated when compiling for gfx1101. _t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp::11: : In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::1010: : In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h::168169: : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h::153271::1419:: warning: warning: unused variable 'data1' [-Wunused-variable]unused variable 'ptr' [-Wunused-variable] 271 | 153 | u i nuti3n2t_6t4 _dta*t ap1t,r f=l arge1c,v Pdtart(a02),+ lfll1a2g82O;f f s| e ^~~~~t ; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h ^~~: 153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ nt wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives7 warnings generated when compiling for gfx900. , 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL7 warnings generated when compiling for gfx906. _STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+llIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_7 warnings generated when compiling for gfx908. ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, daIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ta2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx11007. warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:In file included from 153/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp::141:: In file included from warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hunused variable 'data1' [-Wunused-variable]: 10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable]153 | uint32_ t153 | d a t a 1u,i nftl3a2g_1t, ddaattaa12,, ffllaagg12,; d a| t ^~~~~a 2, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hf:l153a:g212:; warning: unused variable 'flag1' [-Wunused-variable]| ^~~~~ 153 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 153 : 21u:i nwarning: tunused variable 'flag1' [-Wunused-variable]3 2_t d153a | t a 1 , ufilnatg312,_ td adtaat2a,1 ,f lfalga2g;1 , | d ^~~~~a ta2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h,: 153f:l28a:g 2warning: ;unused variable 'data2' [-Wunused-variable] | ^~~~~ 153 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 153u:i28n:t 3warning: 2unused variable 'data2' [-Wunused-variable]_ t da t153a | 1 , f luaign1t,3 2d_att ad2a,t af1l,a gf2l;a g 1| , ^~~~~ dat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ha:2153,: 35f:l awarning: gunused variable 'flag2' [-Wunused-variable]2 ; | 153 ^~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hu:i153n:t353:2 _warning: tunused variable 'flag2' [-Wunused-variable] data 1153, | f l a gu1i,n td3a2t_at2 ,d aftlaa1g,2 ;f l a| ^~~~~g 1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : group(group15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i275d:(90t:i dnote: )in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, nthre a275d | s ( n t h r ePardism)i,t itviedsI563, | / * D isrteecptS=*/0,i zPer(ontcoc,l S0h>m epmr.icmosm m .| b ^u ffSizes[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_595P:R5O:T Onote: _in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereS IMP L595E | ] / N C CrLu_nSTTrEePeSU/psDiozweno90>:( anote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg s); | ^324 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202P:r53i:m inote: tin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herei ves< T202, | R e d O p , FRaunnAWsoyrmkmEeltermiecnP,r o/t*oD>i(r)e.crtu=n*(/w0e,) ;P r o| t ^o , 0> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cppp:r4i:m1s: note: | in instantiation of member function 'RunWork, 0, 2>::run' requested here ^ 4/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | :I595M:P5L:_ Cnote: Oin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereL L_FU N595C | ( A l l RreudnuTcree,e UTpRDEoEw,n > (| a^r gs);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ^95 : note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :391 | note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here RunW o202r | k < n c c l F u nRcu#n#WfournkcE,l etmyepnet,< FFnu,n cT#,# dReevdrOepd,o pAP,r oNtCoC>L(_)A.LrGuOn_(#w#ea)l;g o ,| ^N CCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cppR:O6T:O1_:# #note: pin instantiation of member function 'RunWork, 0, 2>::run' requested herer oto> (6) | .IrMuPnL(_&CnOcLcLl_SFhUmNeCm(.AwlolrRke)d;u c\e , | T ^R EE, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L15E:, note: Sfield 'nthreads' will be initialized after field 'tidInBlock'u mPost D562i | v , i ntti3d2(_tti)d ) ,| ^n thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95(:n tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eads), 391t | i d IRnuBnlWoocrkk(field 'group' will be initialized after field 'stepSize', NCCL_ A562L | G O _ # #taildg(ot,i dN)C,C Ln_tPhRrOeTaOd_s#(#nptrhorteoa>d(s)).,r utni(d&InncBclloSchkm(etmh.rweoardkI)d;x .\x ) ,| ^g roup(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,note: field 'nthreads' will be initialized after field 'tidInBlock' | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15(:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]) , nthreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), g r563o | u p ( g rsotuepp)S,i z e| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n c c| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S hmem .563c | o m m . bsutfefpSSiizzees([nNcCcClLS_hPmReOmT.Oc_oSmImM.PbLuEf]f/SNiCzCeLs_[SNTCECPLS_/PsRiOzTeOo_fS(ITM)P)L E{] / N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| S group(groupT EPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:T324):)90 :{ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 324 | Pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:i324t:i90v:e snote: , FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT , RedO p324, | F a n A s yPmrmiemtirtiicvt,r i/c*Y >p,r i/m*sD i r| e ^c t=*/0,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :P595r:o5t:o ,note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here0 > pr i595m | s | ^r unTree/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hU:p595D:o5w:n , ProtoSimple<1, 1>>' requested here, Red O595p | , P r ortuonSTirmepelUepT>,( aRregdsO)p;, P| r ^o toSimpl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:<2021:,53 :1 >note: >in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here( args )202; | | ^ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hW:o202r:k53E:l enote: min instantiation of member function 'RunWorkElement, 0, 2>::run' requested heree nt<(F)n.,r uTn,( wRee)d;O p ,| ^A /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hFUNC(A:l562l:R15e:d uwarning: cinitializer order does not match the declaration order [-Wreorder-ctor]e , TREE, SIMPLE ,562 | S u m P otsitdD(itvi,d )i,n tn3t2h_rte)a d s| (^n thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95):, note: texpanded from macro 'IMPL_COLL_FUNC'i dInBl o391c | k ( tRhurneWaodrIkdt,e pNSCiCzLe_(AnLcGcOl_S#h#maelmg.oc,o mNmC.CbLu_fPfRSOiTzOe_s#[#NpCrCoLt_oP>R(O)T.Or_uSnI(M&PnLcEc]l/SNhCmCeLm_.SwToErPkS)/;s i\z e o| f ^( T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :{562 : 15| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock'| group(group 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,324 :n90t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds(nthr e324a | d s ) , t iPdrIinmBiltoicvke(st | , / * Dtiirde(ctti=d*)/,0 ,n tPhrroetaod,s (0n>t hprreiamdss ) ,| ^t idInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:(595t:h5r:e anote: din instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereI dx.x) ,595 | g r o u pr(ugnrToruepe)U,p D o| w ^~~~~~~~~~~n >(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, ,NCCL_A LRGeOd_O#p#,a lPgroo,t oNSCiCmLp_lPeR#>p(raortgos>)(;) . r| u ^n (&ncclShmem.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;202 :\53 : | note: ^in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' Run W562o | r k E l etmiedn(tti(d)I.nrBulno(cwke()t;h r e| a ^d Idx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp,: 5g:r1o:u pnote: (in instantiation of member function 'RunWork, 0, 2>::run' requested hereg roup )5, | I M| P ^~~~~~~~~~~~~~~~~L _COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562F:U60N:C (note: Afield 'group' will be initialized after field 'stepSize'l lRedu c562e | , T R EtEi,d (StIiMdP)L,E ,n tShurmePaodsst(Dnitvh,r euaidnst)8,_ tt)i d I| n^B lock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d Inote: dexpanded from macro 'IMPL_COLL_FUNC'x .x), g r391o | u p (RgurnoWuopr)k,< n c| c ^~~~~~~~~~~l Func##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nth rea| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ) ,| group(groupt idInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h.:x275):,90 :g rnote: oin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu p(group )275, | | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), n,threads(nt hNrCeCaLd_sA)L,G Ot_i#d#IanlBgloo,c kN(CtChLr_ePaRdOITdOx_.#x#)p,r ogtroo>u(p)(.grruonu(p&)n,c c l| S ^~~~~~~~~~~~~~~~~h mem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:w562o:r60k:) ;note: field 'group' will be initialized after field 'stepSize'\ | ^ 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~i dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::1562:: 15note: :in instantiation of member function 'RunWork, 0, 2>::run' requested here warning: initializer order does not match the declaration order [-Wreorder-ctor] 9 | IMPL_C O562L | L _ F U NtCi(dA(ltliRde)d,u cnet,h rTeRaEdEs,( nStIhMrPeLaEd,s )S,u mtPiodsItnDBilvo,c ku(itnhtr6e4a_dtI)d x .| x^) , gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p391(:g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC') , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 391| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) RunWo r563k | < n c c lsFtuenpcS#i#zfeu(nncc,c ltSyhpmee,m .Fcuonmcm#.#bduefvfrSeidzoeps<[tNyCpCeL>_,P RNOCTCOL__SAILMGPOL_E#]#/aNlCgCoL,_ SNTCECPLS_/PsRiOzTeOo_f#(#Tp)r)o t{o > (| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. r u| n group(group( &ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:e275m:.90w:o rnote: kin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) ; \ | ^275 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :P562r:i15m:i tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'v esl,o c/k*(Dtihrreecatd=I*d/x0.,x )P,r ogtroo,u p0(>g rporuipm)s, | | ^ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h60::595 :note: 5field 'group' will be initialized after field 'stepSize': note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 562 | 595 | t i dr(utniTdr)e,e UnptDhorwenar>e(aadrIgdsx).;x ) ,| ^g roup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u202p:)53,: note: | in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 324 | P r562i | m i t i vteisd<(Tt,i dR)e,d Onpt,h rFeaandAss(ynmtmherteraidcs<)1,, tNiCdCILn_BMlAoXc_kD(EtVh_rAeRaIdTIYd>x,. x/)*,D igrreocutp=(*g/r0o,u pP)r,o t o| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 0 >| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p rims 563| | ^ ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:S595i:z5e:( nnote: cin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herec lSh m595e | m . c o mrmu.nbTurfefeSUipzDeosw[nNS>/(sairzgeso)f;( T )| ) ^ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h group(group: 202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 324202: | 90 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Ru n324W | o r k E l e mPernitmt(r)i.cr:, note: /in instantiation of member function 'RunWork, 0, 2>::run' requested here* Direc t8= | *I/M0P,L _PCrOoLtLo_,F U0N>C (pArlilmRse d u| c ^e , TR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:E595,: 5S:I Mnote: Pin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereL E, S595u | m P o s trDuinvT,r eienUtp6D4o_wtn)< T ,| ^R edOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :P391r:o95t:o Snote: iexpanded from macro 'IMPL_COLL_FUNC'm ple<1, 1391> | > ( aRrugnsW)o;r k <| n ^c clFun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:#202#:f53u:n cnote: ,in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here typ e202, | F u n c # # d eRvurneWdoorpkn,t T(O)_.#r#upnr(owteo)>;( ) .| r ^u n(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cppc:c9l:S1h:m enote: min instantiation of member function 'RunWork, 0, 2>::run' requested here. wor k9) | ;I M\P L _| C ^O LL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:(562A:l15l:R enote: dfield 'nthreads' will be initialized after field 'tidInBlock'u ce, TR E562E | , S I MtPiLdE(,t iSdu)m,P onstthDrieva,d su(inntth6r4e_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx908. 19 warnings generated when compiling for gfx941. 19 warnings generated when compiling for gfx940. 19 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 19 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 19 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ 19 warnings/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp generated: when compiling for 9gfx1100:. 1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for host. 19 warnings generated when compiling for gfx1030. 19 warnings generated when compiling for gfx1101. 19 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ ag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hup), : 562| : ^~~~~~~~~~~15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitivesd,( t/i*dD)i,r enctth=r*e/a0d,s (Pnrtohtroe,a d0s>) ,p rtiimdsI n B| l ^o ck(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a595d:I5d:x .note: xin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here) , gr o595u | p ( g r oruupn)T,r e e| U ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p D o| w tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n m>.(caormgms.)b;u f f| S ^i zes[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_202P:R53O:T Onote: _in instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereS IMPLE ]202/ | N C C L _ S T E PRSu/nsWiozrekoEfl(eTm)e)n t{< F n| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ T ,| group(groupR edOp, A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:g275o:,90 :P rnote: oin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret o>(). r275u | n ( w e ) ; P r| i ^m itives, 0, 2>::run' requested here, FanA s6y | mImMePtLr_iCcOE,, /S*IDMiPrLeEc,t =S*u/m0,, iPnrto3t2o_,t )0 > | p^r ims /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 ^: 95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :595:5: note: 391in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here | Ru n595W | o r k < nrcucnlTFruenecU#p#Dfouwnnc<,T ,t yRpeed,O pF,u nPcr#o#tdoeSvirmepdloep<<1t,y p1e>>>,( aNrCgCsL)_;A L G| O ^_ ##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C202C:L53_:P Rnote: Oin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereT O_## p202r | o t o > ( ) . r uRnu(n&WnocrcklESlhemmeemn.tw().r u562n | ( w e ) ;t i d| ( ^t id), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpph:r9e:a1d:s (note: nin instantiation of member function 'RunWork, 0, 2>::run' requested heret hre a9d | sI)M,P Lt_iCdOILnLB_lFoUcNkC((tAhlrleRaeddIudcxe.,x )T,R EgEr,o uSpI(MgPrLoEu,p )S,u m ,| ^~~~~~~~~~~~~~~~~u int6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h4:_562t:)60 : | note: ^field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391562: | 95 : note: expanded from macro 'IMPL_COLL_FUNC't id(tid )391, | n tRhurneWaodrsk(g,r oNuCpC)L,_ A L| G ^~~~~~~~~~~O _##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dwarning: )initializer order does not match the declaration order [-Wreorder-ctor], nthreads(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~e adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 15 :| ^~~~~~~~~~~~~~~~~warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562562 | | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype>, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:m562m:.15b:u fwarning: finitializer order does not match the declaration order [-Wreorder-ctor]S izes[NC C562L | _ P R O TtOi_dS(ItMiPdL)E,] /nNtChCrLe_aSdTsE(PnSt/hsriezaedosf)(,T )t)i d{I n B| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o c k| ( group(groupt hreadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u324p:(90g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 324| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563P | r i m i tsitveepsS],/ N/C*CDLi_rSeTcEtP=S*//s0i,z ePorfo(tTo),) 0{> p| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i m s| group(group | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h275::59590::5 :note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 275 | 595 | rPurniTmrieteiUvpeDsoM>A(Xa_rDgEsV)_;A R I| T ^Y , 1>, /*D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:r202e:c53t:= *note: /in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here0 , Pr o202t | o , 0 > p r iRmusn W o| r ^k Eleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:t595<:F5n:, note: Tin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here, Red O595p | , A l grou,n TPrreoetUop>D(o)w.nr, 0, 2>::run' requested here> >(ar g12s | )I;M P L| _ ^C OLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:(202A:l53l:R enote: din instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereu ce, T202R | E E , S I M P LREu,n WSourmk,E ldeomuebnlte<)F n ,| ^T , Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:p391,: 95A:l gnote: oexpanded from macro 'IMPL_COLL_FUNC', Proto >391( | ) . rRuunn(Wwoer)k;< n c| c ^l Func##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cppf:u8n:c1,: tnote: yin instantiation of member function 'RunWork, 0, 2>::run' requested herep e, F8u | nIcM#P#Ld_eCvOrLeLd_oFpUl,R eNdCuCcLe_,A LTGROE_E#,# aSlIgMoP,L EN,C CSLu_mP,R OiTnOt_6#4#_ptr)o t o| >^( ).ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:(391&:n95c:c lnote: Sexpanded from macro 'IMPL_COLL_FUNC'h mem.wo r391k | ) ; R\u n W| o ^r k(,n tNhCrCeLa_dAsL)G,O _t#i#daIlngBol,o cNkC(CtLh_rPeRaOdTIOd_x#.#xp)r,o tgor>o(u)p.(rgurno(u&pn)c,c l S| h ^~~~~~~~~~~~~~~~~m em./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:o562r:k60):; note: \field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dnote: (field 'nthreads' will be initialized after field 'tidInBlock't id), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s(nthread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~k (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)60 : note: field 'group' will be initialized after field 'stepSize' 563 | 562 | s t e p Stiizde((tnicdc)l,S hnmtehmr.ecaodmsm(.nbtuhfrfeSaidzse)s,[ NtCiCdLI_nPBRlOoTcOk_(StIhMrPeLaEd]I/dNxC.CxL)_,S TgErPoSu/ps(igzreooufp()T,) ) | { ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep e, Fun c324# | # d e v r e dPorpie,s D(E)V._rAuRnI(T&Yn>c,c l/S*hDmierme.cwto=r*k/)0;, \P r o| t ^o , 0> prims /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595: 5562: | note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here tid (595t | i d ) , rnutnhTrreeaedUsp(Dnotwhnr.>x()a,r ggsr)o;u p (| g ^r oup), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^~~~~~~~~~~~~~~~~: 202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here: 60: note: field 'group' will be initialized after field 'stepSize'202 | 562 | R u n Wtoirdk(Etliedm)e,n tno(c)k.(rtuhnr(ewaed)I;d x .| x ^) , grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cppp:(13g:r1o:u pnote: )in instantiation of member function 'RunWork, 0, 2>::run' requested here, | ^~~~~~~~~~~13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~562 : 15| : group(group warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h562: | 324 : 90 : tnote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered (tid), 324n | t h r e a d sP(rnitmhirteiavdess)<,T ,t iRdeIdnOBpl,o cFka(ntAhsryemamdeItdrxi.cx<)1,, gNrCoCuLp_(MgArXo_uDpE)V,_ A R| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T Y >| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /*Dir e563c | t = * / 0s,t ePprSoitzoe,( n0c>c lpSrhimmesm . c| o ^m m.bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:f595S:i5z:e snote: [in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereN CCL_ P595R | O T O _ SrIuMnPTLrEe]e/UNpCDCoLw_nS>(args)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h;: 324 :| 90 ^: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :324 | note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here P202r | i m i t i v e s A(R)I.TrYu>n,( w/e*)D;i r e| c ^t =*/0,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp :P6r:o1t:o ,note: in instantiation of member function 'RunWork, 0, 2>::run' requested here0 > pr i6m | sI M P| L ^_ COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:F595U:N5C:( Anote: lin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herel Red u595c | e , T RrEuEn,T rSeIeMUPpLDEo,w nP >note: (expanded from macro 'IMPL_COLL_FUNC'a rgs); | 391 ^ | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:r202k:<53n:c cnote: lin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereF unc# #202f | u n c , t y p eR,u nFWuonrck#E#ldeemvernetd, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIroto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tric, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitivesa,d s/(*nDtihrreecatd=s*)/,0 ,t iPdrIontBol,o c0k>( tphrriemasd I d| x ^. x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o595u:p5(:g rnote: oin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereu p), 595| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r unTre e563U | p D o w nsS>i(zaersg[sN)C;C L _| P ^R OTO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:P202L:E53]:/ Nnote: Cin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereC L_ST E202P | S / s i z e o f (RTu)n)W o{r k E| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e m e| n group(groupt , FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereA lgo, P r275o | t o > ( ) . rPurni(mwiet)i;v e s| < ^T , RedO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppp:,8 :F1a:n Anote: sin instantiation of member function 'RunWork, 0, 2>::run' requested herey mmet r8i | cIe,, /T*RDEiEr,e cStI=M*P/L0E,, PPrrootdo,, i0n>t 6p4r_itm)s | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::595391::595:: note: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereexpanded from macro 'IMPL_COLL_FUNC' 595 | 391 | r uRnuTnrWeoerUkpr>e(daorpg| , ^ NCCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:G202O:_53#:# anote: lin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereg o, N C202C | L _ P R O T O _ #R#upnrWootrok>E(l)e.mreunnt(<&Fnnc,c lTS,h mReemd.Owpo,r kA)l;g o\, P| r ^o to>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:r562u:n15(:w enote: )field 'nthreads' will be initialized after field 'tidInBlock'; | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp : 9t:i1d:( tnote: iin instantiation of member function 'RunWork, 0, 2>::run' requested hered ), n t9h | rIeMaPdLs_(CnOtLhLr_eFaUdNsC)(,A ltliRdeIdnuBcleo,c kT(RtEhEr,e aSdIIMdPxL.Ex,) ,P rgordo,u pu(ignrto6u4p_)t,) | | ^~~~~~~~~~~~~~~~~^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562391::6095:: note: note: field 'group' will be initialized after field 'stepSize'expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | tRiudn(Wtoirdk)<,n cnctlhFruenacd#s#(fnutnhcr,e atdysp)e,, tFiudnIcn#B#ldoecvkr(etdhorpe.,x )N,C CgLr_oAuLpG(Og_r#o#uapl)g,o , | N ^~~~~~~~~~~C CL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :n15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a ds(nthre a562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~t hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitivest,h r/e*aDdisr(enctth=r*e/a0d,s )P,r ottiod,I n0B>l opcrki(mtsh r e| a ^d Idx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:)595,: 5g:r onote: uin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herep (gro u595p | ) , | r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n T| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e eUpDo w563n | < T , RsetdeOppS,i zPer(ontcocSliSmhpmleem<.1c,o m1m>.>b(uafrfgSsi)z;e s [| N ^C CL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:_202S:I53M:P Lnote: Ein instantiation of member function 'RunWorkElement, 0, 2>::run' requested here] /NCC L202_ | S T E P S / s i zReuonfW(oTr)k)E l{e m e| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t < F| n group(group, T, RedO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:,275 :A90l:g onote: ,in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Proto >275( | ) . r u n ( wPer)i;m i t| i ^v es, 0, 2>::run' requested herea nAsy m11m | eItMrPiLc_R,E E/,* DSiIrMePcLtE=,* /P0r,o dP,r oftloo,a t0)> p| r^i ms | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :expanded from macro 'IMPL_COLL_FUNC'595 :5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 391 | 595R | u n W o rrkuy>p(ea>r,g sN)C;C L _| A ^L GO_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:g202o:,53 :N Cnote: Cin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereL _PRO T202O | _ # # p r o t o >R(u)n.Wrournk(E&lnecmcelnSth (note: )field 'nthreads' will be initialized after field 'tidInBlock'. run(w e562) | ; | ^t id(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp):,11 :n1t:h rnote: ein instantiation of member function 'RunWork, 0, 2>::run' requested herea ds( n11t | hIrMePaLd_sC)O,L Lt_iFdUINnCB(lAolclkR(etdhurceea,d ITdRxE.Ex,) ,S IgMrPoLuEp,( gPrrooudp,) ,f l o| a ^~~~~~~~~~~~~~~~~t ) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h60::391 :note: 95field 'group' will be initialized after field 'stepSize': note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t iRdu(ntWiodr)k,< nnctchlrFeuandcs#(#nftuhnrce,a dtsy)p,e ,t iFduInncB#l#odcekv(rtehdroepax,) ,N CgCrLo_uApL(GgOr_o#u#pa)l,g o ,| ^~~~~~~~~~~N CCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :P562r:o15t:o ,warning: initializer order does not match the declaration order [-Wreorder-ctor]0 > prims | 562 ^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d595(:t5i:d )note: ,in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here nthr e595a | d s ( n trhurneTardese)U,p DtoiwdnIg>r(oaurpg)s,) ; | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :563202 | : 53 : note: sin instantiation of member function 'RunWorkElement, 0, 2>::run' requested heret epSi z202e | ( n c c l S h m eRmu.ncWoomrmk.EbluefmfeSnitz_(S)T.ErPuSn/(swiez)e;o f (| T ^) ) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 8 :| 1 group(group: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:C275O:L90L:_ Fnote: Uin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereN C(All R275e | d u c e , TPRrEiEm,i tSiIvMePsLu,n W/o*rDkip ep,r iFmusn c #| # ^d evredo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:<595t:y5p:e >note: ,in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here NCCL _595A | L G O _ #r#uanlTgroe,e UNpCDCoLw_nPo(t)o.Sriumnp(l&enm>e(ma.rwgosr)k;) ; | \ ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h202::56253::15 :note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested herenote: field 'nthreads' will be initialized after field 'tidInBlock' 202 | 562 | t iRdu(ntWiodr)k,E lnetmhernetaa(d)I.drxu.nx()w,e )g;r o u| p ^( group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp : 10| : ^~~~~~~~~~~~~~~~~1 : note: in instantiation of member function 'RunWork, 0, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60 :10 | note: Ifield 'group' will be initialized after field 'stepSize'M PL_COL L562_ | F U N C (tAildl(Rteiddu)c,e ,n tThRrEeEa,d sS(InMtPhLrEe,a dPsr)o,d ,t ihdaIlnfB)l o c| k^( thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391I:d95x:. xnote: )expanded from macro 'IMPL_COLL_FUNC', group (391g | r o uRpu)n,W o r| k ^~~~~~~~~~~< ncclFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:c562o:m15m:. bwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]f fSizes[NC C562L | _ P R O TtOi_dS(ItMiPdL)E,] /nNtChCrLe_aSdTsE(PnSt/hsriezaedosf)(,T )t)i d{I n B| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o c k| ( group(groupt hreadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,324 :g90r:o unote: pin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tidIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cppt:i1d: )In file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :n10t: hIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.he:a167d: s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s ), tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o60c:k (note: tfield 'group' will be initialized after field 'stepSize'h readId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( nthrea d563s | ) , t isdtIenpBSliozcek((ntchcrleSahdmIedmx..cxo)m,m .gbruofufpS(igzreosu[pN)C,C L _| P ^~~~~~~~~~~R OTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 15 ^~~~~~~~~~~~~~~~~: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: field 'group' will be initialized after field 'stepSize'562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, in/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~B lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid)x.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o595u:p5(:g rnote: oin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereu p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 595 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) runTreeU p563D | o w n < Ts,t eRpeSdiOzpe,( nPcrcoltSohSmiemmp.lceof>f(Sairzgess)[;N C C| L ^_ PROTO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h]:/202N:C53C:L _note: Sin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereT EPS/ s202i | z e o f ( T ) ) R{u n W| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r k E| l group(groupe ment, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret o>().ru n324( | w e ) ; | P ^r imiti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cppv:e7s:<1T:, note: Rin instantiation of member function 'RunWork, 0, 2>::run' requested heree dOp, 7F | aInMAPsLy_mCmOeLtLr_iFcUM,P L/E*,D iMraexc,t =u*i/n0t,3 2P_rto)t o ,| ^0 > pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:s391 : 95| : ^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595: 5391: | note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereR unWo r595k | < n c c lrFuunnTcr#e#efUupnDco,w nt1,> >N(CaCrLg_sA)L;G O _| # ^# algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:C202L:_53P:R Onote: Tin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereO _## p202r | o t o > ( ) . r uRnu(n&WnocrcklESlhemmeemn.twfield 'nthreads' will be initialized after field 'tidInBlock'( ).run( w562e | ) ; | t ^i d(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cppn:t8h:r1e:a dnote: sin instantiation of member function 'RunWork, 0, 2>::run' requested here( nthr e8a | dIsM)P,L _tCiOdLILn_BFlUoNcCk((AtlhlrReeadduIcdex,. xT)R,E Eg,r oSuIpM(PgLrEo,u pM)a,x , | i ^~~~~~~~~~~~~~~~~n t64_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:)562 : 60| :^ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391562: | 95 : note: expanded from macro 'IMPL_COLL_FUNC't id(tid) ,391 | n t hRruenaWdosr(knr,o uNpC)C,L _ A| L ^~~~~~~~~~~G O_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ anAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:)562;: 15 :| ^warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: 562note: | in instantiation of member function 'RunWork, 0, 2>::run' requested here t9i | dI(MtPiLd_)C,O LnLt_hFrUeNaCd(sA(lnltRherdeuacdes,) ,T RtEiEd,I nSBIlMoPcLkE(,t hMraexa,d Iudixn.tx6)4,_ tg)r o u| p^( grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:)391,: 95 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: expanded from macro 'IMPL_COLL_FUNC' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | 563 | R u n WsotrekpSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p(gr o563u | p ) , s| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e p S| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)z e(ncc l563S | h m e m .sctoempmS.ibzuef(fnSciczleSsh[mNeCmC.Lc_oPmRmO.TbOu_fSfISMiPzLeEs][/NNCCCCLL__PSRTOETPOS_/SsIiMzPeLoEf](/TN)C)C L{_ S T| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P S /| s group(groupi zeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :{275 : 90| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| group(group 275 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 275 : 90 :P rnote: iin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem itive s275< | T , R e d OPpr,i mFiatniAvseysmM,A X/_*DDEiVr_eAcRtI=T*Y/,0 ,1 >P,r o/t*oD,i r0e>c tp=r*i/m0s, P| r ^o to, 0/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h>: 595p:r5i:m snote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here | ^ 595 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 595 :r5u:n Tnote: rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heree eUpD o595w | n < T , rRuendTOrpe,e UPprDootwonSP>r(oatrogSsi)m;p l e| < ^1 , 1>>(a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g202s:)53;: note: | in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: Rin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereu nWor k202E | l e m e n t < F nR,u nTW,o rRkeEdlOepm,e nAtld(O)p.,r uAnl(gwoe,) ;P r o| t ^o >().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp(:w10e:)1;: note: | in instantiation of member function 'RunWork, 0, 2>::run' requested here ^ 10 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cppM:P11L:_1C:O Lnote: Lin instantiation of member function 'RunWork, 0, 2>::run' requested here_ FUN C11( | AIlMlPRLe_dCuOcLeL,_ FTURNECE(,A lSlIRMePdLuEc,e ,M aTxR,E Eh,a lSfI)M P L| E^, Max,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :f391l:o95a:t )note: expanded from macro 'IMPL_COLL_FUNC' | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h391: | 391 : 95R:u nnote: Wexpanded from macro 'IMPL_COLL_FUNC'o rk#,# dNeCvCrLe_dAoLpGl,g oN,C CNLC_CALL_GPOR_O#T#Oa_l#g#op,r oNtCoC>L(_)P.RrOuTnO(_&#n#cpcrloSthom>e(m)..wrournk()&;n c\c l S| h ^m em.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;15 :\ note: field 'nthreads' will be initialized after field 'tidInBlock'| ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :t inote: dfield 'nthreads' will be initialized after field 'tidInBlock'( tid), 562n | t h r e atdids((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60):, note: field 'group' will be initialized after field 'stepSize'| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 60 : tnote: ifield 'group' will be initialized after field 'stepSize'd (tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~o up(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(BAllolcRke(dtuhcree,a dTIRdExE.,x )S,I MgPrLoE,u pM(agxr,o ufpl)o,a t )| ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I15d:x .warning: xinitializer order does not match the declaration order [-Wreorder-ctor]) , group( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~t id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;15 :\ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup), 563 | | ^~~~~~~~~~~~~~~~~ s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562p:S60i:z enote: (field 'group' will be initialized after field 'stepSize'n cclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| ^ :562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 562:60: note: 563field 'group' will be initialized after field 'stepSize' | s t562e | p S i z et(indc(ctliSdh)m,e mn.tchormema.dbsu(fnftShirzeeasd[sN)C,C Lt_iPdRIOnTBOl_oScIkM(PtLhEr]e/aNdCICdLx_.SxT)E,P Sg/rsoiuzpe(ogfr(oTu)p)) ,{ | | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 | t324i | d ( t i d ) ,P rnitmhirteiavdess(p,) ,/ * D| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e c| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)= */0, P563r | o t o , s0t>e ppSriizmes( n c| c ^l Shmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h.:c595o:m5m:. bnote: uin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heref fSiz e595s | [ N C C Lr_uPnRTOrTeOe_USpIDMoPwLnE<]T/,N CRCeLd_OSpT,E PPSr/ostiozSeiomfp(lTe)<)1 ,{ 1 >| > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( a r| g group(groups ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h90::202 :note: 53in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 275 | 202 | P r i m iRtuinvWeosrE(V)_.ArRuInT(Yw,e )1;> , | / ^* Direct=/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp*:/40:,1 :P rnote: oin instantiation of member function 'RunWork, 0, 2>::run' requested heret o, 04> | IpMrPiLm_sC O L| L ^_ FUNC(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hA:l595l:R5e:d unote: cin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heree , TR E595E | , S I MrPuLnET,r eMeiUnp,D oiwnnt<8T_,t )R e d| O^p , Pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:o391S:i95m:p lnote: eexpanded from macro 'IMPL_COLL_FUNC'< 1, 1>> (391a | r g sR)u;n W o| r ^k , 0, 2>::run' requested here, typ e202, | F u n c # # d eRvurneWdoorpkn,t T(O)_.#r#upnr(owteo)>;( ) .| r ^u n(&nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppS:h5m:e1m:. wnote: oin instantiation of member function 'RunWork, 0, 2>::run' requested herer k); \5 | I| M ^P L_COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562F:U15N:C (note: Afield 'nthreads' will be initialized after field 'tidInBlock'l lRedu c562e | , T R EtE,i dS(ItMiPdL)E,, nMtihnr,e audisn(tn8t_htr)e a d| s^) , ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I391n:B95l:o cnote: kexpanded from macro 'IMPL_COLL_FUNC'( threadId x391. | x ) ,R ugnrWoourpk(),, NnCtChLr_eAaLdGsO(_n#t#harlegaod,s )N,C CtLi_dPIRnOBTlOo_c#k#(ptrhorteoa>d(I)d.xr.uxn)(,& ngcrcoluSph(mgermo.uwpo)r,k ) ;| ^~~~~~~~~~~\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitivesi,d )/,* Dnitrherceta=d*s/(0n,t hPrreoatdos,) ,0 >t ipdrIinmBsl o c| k ^( threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I595d:x5.:x )note: ,in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here grou p595( | g r o u pr)u,n T r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e U p| D tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o wn.>c(oamrmg.sb)u;f f S| i ^z es[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:L202_:P53R:O Tnote: Oin instantiation of member function 'RunWorkElement, 0, 2>::run' requested here_ SIMP L202E | ] / N C C L _ S TREuPnSW/osrikzEeloefm(eTn)t)< F{n , | T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, R| e group(groupd Op, Algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 324P:r90o:t onote: >in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( ).run (324w | e ) ; | ^P rimitiv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppe:s5<:T1,: Rnote: ein instantiation of member function 'RunWork, 0, 2>::run' requested hered Op, F5a | nIAMsPyLm_mCeOtLrLi_cFI,M P/L*ED,i rMeicnt,= *u/i0n,t 8P_rto)t o ,| ^0 > pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:s391 : 95| : ^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :391595 | : 5 :R unote: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereW ork <595n | c c l F urnucn#T#rfeuenUcp,D otwynp1,, N1C>C>L(_aArLgGsO)_;# # a| l ^g o, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:L202_:P53R:O Tnote: Oin instantiation of member function 'RunWorkElement, 0, 2>::run' requested here_ ##p r202o | t o > ( ) . r u nR(u&nnWcocrlkSEhlmeemme.nwto().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15s:t ewarning: pinitializer order does not match the declaration order [-Wreorder-ctor]S ize(ncclShm e562m | . c o m mt.ibdu(ftfiSdi)z,e sn[tNhCrCeLa_dPsR(OnTtOh_rSeIaMdPsL)E,] /tNiCdCILn_BSlToEcPkS(/tshirzeeaodfI(dTx).)x ){, g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ( group(groupg roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)275 :90: note: 563in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | st e275p | S i z e ( n cPcrliSmhimteimv.ecsoi,z e/o*fD(iTr)e)c t{= * /| 0 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, P| r group(groupo to, 0> prims | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :324:90: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here595 :5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here324 | 595 | P r i mriutniTvreeseA>X(_aDrEgVs_)A;R I T| Y ^> , /*Dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:t202=:*53/:0 ,note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereP roto ,202 | 0 > p r i m s R u| n ^W orkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:e595n:t5<:F nnote: ,in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here T, R e595d | O p , ArlugnoT,r ePerUoptDoo>w(n)<.Tr,u nR(ewdeO)p;, P| r ^o toSimple/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp<:15,: 11:> >note: (in instantiation of member function 'RunWork, 0, 2>::run' requested herea rgs )5; | I M| P ^L _COLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:(202A:l53l:R enote: din instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereu ce, TR E202E | , S I M P L E ,R uMniWno,r kuEilnetm8e_ntt)< F n| ,^ T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:d391O:p95,: Anote: lexpanded from macro 'IMPL_COLL_FUNC'g o, Prot o391> | ( ) .RruunnW(owrek)<;n c c| l ^F unc##fu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppn:c6,: 1t:y pnote: ein instantiation of member function 'RunWork, 0, 2>::run' requested here, Fun c6# | #IdMePvLr_eCdOoLpL<_tFyUpNeC>(,A lNlCRCeLd_uAcLeG,O _T#R#EaEl,g oS,I MNPCLCEL,_ PMRiOnT,O _i#n#tp3r2o_tt)o > (| )^. run(&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:c391c:l95S:h mnote: eexpanded from macro 'IMPL_COLL_FUNC'm .work); 391\ | | R ^u nWorkt,h rNeCaCdLs_)A,L GtOi_d#I#naBllgooc,k (NtChCrLe_aPdRIOdTxO._x#)#,p rgortoou>p(()g.rrouunp()&,n c c| l ^~~~~~~~~~~~~~~~~S hmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:w562o:r60k:) ;note: field 'group' will be initialized after field 'stepSize'\ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here (we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) o t563o | , 0 > sptreipmSsi z e| ( ^n cclShmem.comm.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:z595e:s5[:N Cnote: Cin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereL _PROT O595_ | S I M P LrEu]n/TNrCeCeLU_pSDToEwPnS90>:( anote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg s); | 324 ^ | Pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:i202t:i53v:e snote: , 0, 2>::run' requested hereT , Re d202O | p , F a n A s yRmumneWtorrikcEl,g o/,* DPirroetcot>=(*)/.0r,u nP(rwoet)o;, 0| > ^ prims | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp ^: 6:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 595in instantiation of member function 'RunWork, 0, 2>::run' requested here: 5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here6 | IM P595L | _ C O L Lr_uFnUTNrCe(eAUlplDRoewdnu2>_(ta)r g s| )^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h95::202 :note: 53expanded from macro 'IMPL_COLL_FUNC': note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDow/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:,562 :N15C:C Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]P ROTO_##pro t562o | > ( ) . rtuind((&tnicdc)l,S hnmtehmr.ewaodrsk()n;t h\r e a| d ^s ), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l15o:c knote: (field 'nthreads' will be initialized after field 'tidInBlock't hread I562d | x . x ) ,t igdr(otuipd()gr,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ( n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads )563, | t i d IsntBelpoScikz(et(hnrcecaldSIhdmxe.mx.)c,o mgmr.obuupf(fgSriozueps)[,N C C| L ^~~~~~~~~~~~~~~~~_ PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562S:I60M:P Lnote: Efield 'group' will be initialized after field 'stepSize'] /NCCL _562S | T E P S /tsiidz(etoifd()T,) )n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s (| n group(groupt hreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i324d:I90n:B lnote: oin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec k(threa d324I | d x . x ) , Pgrriomuipt(igvreosu, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdI:d562x:.15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]g roup(group )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562,: | 562 : | 15 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: t warning: i| initializer order does not match the declaration order [-Wreorder-ctor]d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ( tid) ,563 | 562n | t h r es attdeispSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562r:u15n:T rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]e UpDowns>((natrhgrse)a;d s )| , ^ tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hk:(202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereI dx.x) ,202 | g r o u p ( g r oRuupn)W,o r k| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l e m| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n tm(.)c.ormumn.(bwuef)f;S i z| e ^s [NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppO:T10O:_1S:I Mnote: Pin instantiation of member function 'RunWork, 0, 2>::run' requested hereL E]/N C10C | LI_MSPTLE_PCSO/LsLi_zFeUoNfC((TA)l)l R{e d u| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e , | T group(groupR EE, SIMPLE, Mi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:,324 :h90a:l fnote: )in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 324 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : Pnote: rexpanded from macro 'IMPL_COLL_FUNC'i mitive s391< | T , RRuendWOopr,k p,< t/y*pDei>r,e cNtC=C*L/_0A,L GPOr_o#t#oa,l g0o>, pNrCiCmLs_ P R| O ^T O_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:t595o:>5(:) .note: rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereu n(& n595c | c l S h mreumn.TwroerekU)p;D o\w n <| T ^, RedOp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562P:r15o:t onote: Sfield 'nthreads' will be initialized after field 'tidInBlock'i mple< 1562, | 1 > > (tairdg(st)i;d ) ,| ^n threads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 0, 2>::run' requested heres ), t202i | d I n B l o c k (RtuhreadIndWxo.rxk)E,l egmreonutp<(Fgnr,o uTp,) ,R e d| O ^~~~~~~~~~~~~~~~~p , Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:o562,: 60P:r onote: tfield 'group' will be initialized after field 'stepSize'o >().r u562n | ( w e ) ;t i d| ( ^t id), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpph:r9e:a1d:s (note: nin instantiation of member function 'RunWork, 0, 2>::run' requested heret hre a9d | sI)M,P Lt_iCdOILnLB_lFoUcNkC((tAhlrleRaeddIudcxe.,x )T,R EgEr,o uSpI(MgPrLoEu,p )M,i n ,| ^~~~~~~~~~~u int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' dOp, FanAsymmetricl,F u/n*cD#i#rfeucntc=,* /t0y,p eP,r oFtuon,c #0#>d epvrriemdso p <| t ^y pe>, NCCL_ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:l595g:o5,: Nnote: Cin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereC L_PRO T595O | _ # # p rroutnoT>r(e)e.UrpuDno(w&nn>(args); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 562 | 202 | t i d ( t i d ) ,R unntWhorrekaEdlse(mnetnhtrx(.)x.)r,u ng(rwoeu)p;( g r| o ^u p), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h1::562 :note: 60in instantiation of member function 'RunWork, 0, 2>::run' requested here: note: field 'group' will be initialized after field 'stepSize' 9 | IMPL _562C | O L L _ FtUiNdC((tAildl)R,e dnutcher,e aTdRsE(En,t hSrIeMaPdLsE),, Mtiind,I nuBilnotc6k4(_tth)r e a| d^I dx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391g:r95o:u pnote: (expanded from macro 'IMPL_COLL_FUNC'g roup) ,391 | | ^~~~~~~~~~~R unWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: note: expanded from macro 'IMPL_COLL_FUNC' :562:15: 391warning: | initializer order does not match the declaration order [-Wreorder-ctor] RunWorkd,s )N,C CtLi_dAILnGBOl_o#c#ka(ltghor,e aNdCICdLx_.PxR)O,T Og_r#o#uppr(ogtroo>u(p)).,r u n| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~& n c| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l Shmem .563w | o r k ) ;s t\e p S| i ^z e(ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:m562e:m15.:c onote: mfield 'nthreads' will be initialized after field 'tidInBlock'm .buff S562i | z e s [ NtCiCdL(_tPiRdO)T,O _nStIhMrPeLaEd]s/(NnCtChLr_eSaTdEsP)S,/ stiizdeIonfB(lTo)c)k ({t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d I| d group(groupx .x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~~~~~~~~~~~~~: 275:90:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 60: note: field 'group' will be initialized after field 'stepSize' 275 | 562 | P r i mtiitdi(vteisd<)T,, nRtehdrOepa,d sF(anntAhsryemamdest)r,i ct,, g/r*oDuipr(egcrto=u*p/)0,, P| r ^~~~~~~~~~~o to, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562):,15 :t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]I nBlock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s ), tid I563n | B l o c ks(ttehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s [ N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_PRO T563O | _ S I M PsLtEe]p/SNiCzCeL(_nScTcElPSSh/mseimz.ecoofm(mT.)b)u f{f S i| z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e s [| N group(groupC CL_PROTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:L324E:]90/:N Cnote: Cin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL _STEPS /324s | i z e o f ( TP)r)i m{i t i| v ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e s <| T group(group, RedOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :F275a:n90A:s ynote: min instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem etric< 1275, | N C C L _ MPArXi_mDiEtVi_vAeRsI ,R e/d*ODpi,r eFcatn=A*s/y0m,m ePtrroitco<,N C0C>L _pMrAiXm_sD E V| _ ^A RITY, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h1:>595,: 5/:* Dnote: iin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herer ect=* /5950 | , P r ortuon,T r0e>e UpprDiomwsn < T| , ^ RedO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:,595 :P5r:o tnote: oin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereS imp l595e | < 1 , 1r>u>n(TarregesU)p;D o w| n ^< T, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:p202,: 53P:r onote: tin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereo Simpl e202< | 1 , 1 > > ( a rRgusn)W;o r k| E ^l ement, 0, 2>::run' requested heree dOp, 202A | l g o , P r o tRou>n(W)o.rrkuEnl(ewmee)n;t < F| n ^, T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppe:d12O:p1,: Anote: lin instantiation of member function 'RunWork, 0, 2>::run' requested hereg o, P r12o | tIoM>P(L)_.CrOuLnL(_wFeU)N;C ( A| l ^l Reduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppe:,11 :T1R:E Enote: ,in instantiation of member function 'RunWork, 0, 2>::run' requested here SIM P11L | EI,M PMLi_nC,O LdLo_uFbUlNeC)( A l| l^R educe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391T:R95E:E ,note: expanded from macro 'IMPL_COLL_FUNC'S IMPLE, M i391n | , fRluonaWto)r k <| n^c clF/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n391c:#95#:f unote: nexpanded from macro 'IMPL_COLL_FUNC'c , type ,391 | F u nRcu#n#Wdoervkr#,f uNnCcC,L _tAyLpGeO,_ #F#uanlcg#o#,d eNvCrCeLd_oPpR#,p rNoCtCoL>_(A)L.GrOu_n#(#&anlcgcol,S hNmCeCmL._wPoRrOkT)O;_ #\# p r| o ^t o>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:u562n:(15&:n cnote: cfield 'nthreads' will be initialized after field 'tidInBlock'l Shmem. w562o | r k ) ; t\i d (| t ^i d), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock'( nthre a562d | s ) , ttiiddI(ntBildo), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ */0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Di r562e | c t = * /t0i,d (Ptriodt)o,, n0t>h rperaidmss( n t| h ^r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,595 :t5i:d Inote: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereB lock (595t | h r e a drIudnxT.rxe)e,U pgDroowunp<(Tg,r oRuepd)O,p , | P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o t| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S imple <5631 | , 1 > >s(taerpgSsi)z;e ( n| c ^c lShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:.202c:o53m:m .note: bin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereu ffS i202z | e s [ N C C L _ PRRuOnTWOo_rSkIEMlPeLmEe]n/tN ( )| . group(groupr un(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp :note: 11in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 324 | 11 | I MPPrLi_mCiOtLiLv_eFsU,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :/391*:D95i:r enote: cexpanded from macro 'IMPL_COLL_FUNC't =*/0, P391r | o t oR,u n0W>o rpkr, ProtoSimple<1, 1>>' requested here Func #595# | d e v r erduonpTD,o wNnC#>#(parrogtso)>;( ) .| r ^u n(&ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:S202h:m53e:m .note: win instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereo rk); 202\ | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:u562n:W15o:r knote: Efield 'nthreads' will be initialized after field 'tidInBlock'l ement <562F | n , T ,t iRde(dtOipd,) ,A lngtoh,r ePardost(on>t(h)r.eraudns()w,e )t;i d I| n ^B lock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpph:r10e:a1d:I dnote: xin instantiation of member function 'RunWork, 0, 2>::run' requested here. x), 10g | rIoMuPpL(_gCrOoLuLp_)F,U N C| ( ^~~~~~~~~~~~~~~~~A llR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562u:c60e:, note: Tfield 'group' will be initialized after field 'stepSize'R EE, S562I | M P L E ,t iMdi(nt,i dh)a,l fn)t h r| e^a ds(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d snote: )expanded from macro 'IMPL_COLL_FUNC', tidIn B391l | o c kR(utnhWroerakd, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorki,d (NtCiCdL)_,A LnGtOh_r#e#aadlsg(on,t hNrCeCaLd_sP)R,O TtOi_d#I#npBrlootcok>((t)h.rreuand(I&dnxc.cxl)S,h mgermo.uwpo(rgkr)o;u p\) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562 | s t etpiSdi(ztei(dn)c,c lnSthhmreema.dcso(mnmt.hbruefafdSsi)z,e st[iNdCICnLB_lPoRcOkT(Ot_hSrIeMaPdLIEd]x/.NxC)C,L _gSrToEuPpS(/gsriozuepo)f,( T )| ) ^~~~~~~~~~~~~~~~~ { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~562 : 60| : group(group note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,275 :n90t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds(nthr e275a | d s ) , t iPdrIinmBiltoicvke(st, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h;: 562 :| 15 ^: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202: 53562: | note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here tid( t202i | d ) , n t h r eRaudnsW(onrtkhErleeamdesn)t,< Ftni,d ITn,B lRoecdkO(pt,h rAelagdoI,d xP.rxo)t,o >g(r)o.urpu(ng(rwoeu)p;) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp: 11563: | 1 : note: in instantiation of member function 'RunWork, 0, 2>::run' requested heres tepS i11z | eI(MnPcLc_lCSOhLmLe_mF.UcNoCm(mA.lbluRfefdSuiczee,s [TNRCECEL,_ PSRIOMTPOL_ES,I MMPiLnE,] /fNlCoCaLt_)S T E| P^S /size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:f391(:T95):) note: {expanded from macro 'IMPL_COLL_FUNC' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 391 | RunWork, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #func ,324 | t y p e , FPurnicm#i#tdievverse,, FNaCnCALs_yAmLmGeOt_r#i#ca, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp::13562::115:: note: warning: in instantiation of member function 'RunWork, 0, 2>::run' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 13 | IMPL_C O562L | L _ F U NtCi(dA(ltliRde)d,u cnet,h rTeRaEdEs,( nStIhMrPeLaEd,s )M,i nt,i drIcncBll_obcfkl(otahtr1e6a)d I d| x^. x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r391o:u95p:( gnote: rexpanded from macro 'IMPL_COLL_FUNC'o up), 391| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n Work< n563c | c l F u nsct#e#pfSuinzce,( ntcycpleS,h mFeumn.cc#o#mdme.vbruefdfoSpiC,C LN_CPCRLO_TAOL_GSOI_M#P#LaEl]g/oN,C CNLC_CSLT_EPPRSO/TsOi_z#e#opfr(oTt)o)> ({) . r| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n ( &| n group(groupc clShmem.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):;275 :\90 : | note: ^in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :275562 | : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' Primit i562v | e s < T ,t iRde(dtOipd,) ,F anntAhsryemamdest(rnitche,a d/I*dDxi.rxe)c,t =g*r/o0u,p (Pgrrootuop,) ,0 > | p ^~~~~~~~~~~~~~~~~r ims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^60 : note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595 :5625 | : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here tid( t595i | d ) , nrtuhnrTeraedesU(pnDtohwrnex>)(,a rggrso)u;p ( g| r ^o up), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIhreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pwarning: )initializer order does not match the declaration order [-Wreorder-ctor], | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here In file included from 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp | : 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 10 : In file included from R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hu:n167W: o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562E:l15e:m ewarning: ninitializer order does not match the declaration order [-Wreorder-ctor]t n(t)h.rreuand(sw(en)t;h r e| a ^d s), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cppn:B5l:o1c:k (note: tin instantiation of member function 'RunWork, 1, 2>::run' requested hereh read I5d | xI.MxP)L,_ CgOrLoLu_pF(UgNrCo(uApl)l,R e d| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c e ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R ING, 563S | I M P L Es,t eSpuSmiPzoes(tnDcicvl,S humienmt.8c_otm)m . b| u^f fSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h[:N391C:C95L:_ Pnote: Rexpanded from macro 'IMPL_COLL_FUNC'O TO_SIMP L391E | ] / NRCuCnLW_oSrTkE:, note: Nin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereC CL_ A68L | G O _ # #Parligmoi,t iNvCeCsL<_TP,R ORTeOd_O#p#,p rFoatnoS>y(m)m.erturni(c&c,l S0h,m ePmr.owtoor,k )0;> \p r i| m ^s | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::588 note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:A15L:G Onote: _field 'nthreads' will be initialized after field 'tidInBlock'# #algo, N562C | C L _ P RtOiTdO(_t#i#dp)r,o tnot>h(r)e.ardusn((n&tnhcrcelaSdhsm)e,m .twiodrIkn)B;l o\c k (| t ^h readI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x15):, note: gfield 'nthreads' will be initialized after field 'tidInBlock'r oup( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~~~~~~~t id),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's (nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBlpo)c,k ( t| h ^~~~~~~~~~~~~~~~~r eadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x60):, note: gfield 'group' will be initialized after field 'stepSize'r oup(g r562o | u p ) , t i| d ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 13 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flaIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hg1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ :514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 7 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp(grou:p562):,15 : | warning: initializer order does not match the declaration order [-Wreorder-ctor] ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | 562 | R utniWdo(rtkiEdl)e,m enntthk(()t.hrruena(dwIed)x;. x )| , ^ group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppg:r6o:u1p:) ,note: in instantiation of member function 'RunWork, 0, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 6| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I MPL_CO L563L | _ F U N Cs(tAelplSRiezdeu(cnec,c lTSRhEmEe,m .ScIoMmPmL.Eb,u fPfrSeiMzuelsS[uNmC,C Li_nPtR3O2T_Ot_)S I M| P^L E]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:C391L:_95STEPS/sizeof(:T )note: )expanded from macro 'IMPL_COLL_FUNC' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 391 | | group(group RunWork, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref unc, t y324p | e , F u n cP#r#idmeivtrievdeosp<d,O pN,C CFLa_nAALsGyOm_m#e#tarligco<,1 ,N CNCCLC_LP_RMOATXO__D#E#Vp_rAoRtIoT>Y(>),. r/u*nD(i&rneccctl=S*h/m0e,m .Pwroortko),; 0\> p| r ^i ms | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'595 :5: note: 562in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here | t595i | d ( t i dr)u,n TnrteherUepaDdosw(nnI>d(xa.rxg)s,) ;g r o| u ^p (group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 : 53| : ^~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :20260 | : note: field 'group' will be initialized after field 'stepSize' R562u | n W o r ktEilde(mteindt)<,F nn,t hTr,e aRdesd(Onpt,h rAelagdos,) ,P rtoitdoI>n(B)l.orcukn((twher)e;a d I| d ^x .x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppo:u8p:(1g:r onote: uin instantiation of member function 'RunWork, 0, 2>::run' requested herep ), | 8 ^~~~~~~~~~~ | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, ProtoSimple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 562 :| 15 group(group: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h562: | 324 : 90 : tnote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered (tid) ,324 | n t h r e a dPsr(inmtihtrievaedss<)T,, tRieddIOnpB,l oFcakn(AtshyrmemaedtIrdixc. ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/ *Direct =563* | / 0 , PsrtoetpoS,i z0e>( npcrcilmSsh m e| m ^. comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hb:u595f:f5S:i znote: ein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heres [NC C595L | _ P R O TrOu_nSTIrMePeLUEp]D/oNwCnC group(group> (args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 202note: :in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here53 : note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 324 | 202 | P r i mRiutniWvoersk().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIs(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkn,t hNrCeCaLd_sA(LnGtOh_r#e#aadlsg)o,, tNiCdCILn_BPlRoOcTkO(_t#h#rperaodtIod>x(.)x.)r,u ng(r&onucpc(lgSrhomuepm).,w o r| k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) ; | \ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :s15t:e pnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'i ze(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::562324::6090:: note: note: field 'group' will be initialized after field 'stepSize'in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 324 | t i d (Ptriidm)i,t invtehsr(,g r/o*uDpi)r,e c t| = ^~~~~~~~~~~* /0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx1101. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tiIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ d; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h1:,386 :f9l:a gwarning: 1variable 'wireOffset' set but not used [-Wunused-but-set-variable], data2 ,386 | f l a g 2i;n t | w ^~~~~i re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hO:f153f:s28e:t warning: =unused variable 'data2' [-Wunused-variable] Wir e153W | o r d P euriSnlti3c2e_*tw adrapt a+1 ,2 *fwliadg;1 , | d ^a ta2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hTO_##:p562r:o15t:o >warning: (initializer order does not match the declaration order [-Wreorder-ctor]) .run(&nccl S562h | m e m . wtoirdk()t;i d\) , | n ^t hreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock') , tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I n| B tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l ock(t h563r | e a d I dsxt.exp)S,i zger(onucpc(lgSrhomuepm).,c o m| m ^~~~~~~~~~~~~~~~~. buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e60s:[ Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_PR O562T | O _ S I MtPiLdE(]t/iNdC)C,L _nStThErPeSa/dssi(znetohfr(eTa)d)s ){, t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I n| B group(groupl ock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:)68,: 56g:r onote: uin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herep (grou p68) | , | ^~~~~~~~~~~P rimitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:c562c:l15S:h mwarning: einitializer order does not match the declaration order [-Wreorder-ctor]m .comm.bu f562f | S i z e st[iNdC(CtLi_dP)R,O TnOt_hSrIeMaPdLsE(]n/tNhCrCeLa_dSsT)E,P St/isdiIzneBolfo(cTk)()t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I d x| . group(groupx ), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:g68r:o56u:p )note: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 68| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) P r563i | m i t i vsetse, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs), :t562i:d15I:n Bwarning: linitializer order does not match the declaration order [-Wreorder-ctor]o ck(thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~( nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60):, note: tfield 'group' will be initialized after field 'stepSize'i dInBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i dInB l563o | c k ( t hsrteeapdSIidzxe.(xn)c,c lgSrhomuepm(.gcroomump.)b,u f f| S ^~~~~~~~~~~i zes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ :153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBloIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size17 warnings generated when compiling for gfx1100. of(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562_:C15O:L Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]F UNC(AllReduc e562, | R I N Gt,i dS(ItMiPdL)E,, nStuhmr,e aidnst(3n2t_htr)e a d| s^) , tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:n391B:l95o:c knote: (expanded from macro 'IMPL_COLL_FUNC't hreadId x391. | x ) ,R ugnrWoourpk(e,m .NcCoCmLm_.AbLuGfOf_S#i#zaelsg[oN,C CNLC_CPLR_OPTROO_TSOI_M#P#LpEr]o/tNoC>C(L)_.SrTuEnP(S&/nsciczleSohfm(eTm).)w o{r k )| ; ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \ | group(group| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hfield 'nthreads' will be initialized after field 'tidInBlock': 68:56: note: 562in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here | t i68d | ( t i d )P,r inmtihtrievaedss<(Tn,t hRreedaOdps,) ,F atniSdyImnmBeltorcikc(r,e a0d,I dPxr.oxt)o,, g0r>o uppr(igmrso u p| ) ^, | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h588::5625::60 :note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested herenote: field 'group' will be initialized after field 'stepSize' 588 | 562 | r u n Rtiindg(t(harregasd)s;) , | t ^i dInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:I dnote: xin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here. x), g202r | o u p ( g r o u pR)u,n W o| r ^~~~~~~~~~~k Element().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | ste RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork,, nNtChCrLe_aAdLsG(On_t#h#raelagdos,) ,N CtCiLd_IPnRBOlToOc_k#(#tphrroetaod>I(d)x..rxu)n,( &gnrcoculpS(hgmreomu.pw)o,r k )| ; ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ \ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^ 563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :s tnote: efield 'nthreads' will be initialized after field 'tidInBlock'p Size( n562c | c l S h mteimd.(ctoimdm).,b unftfhSriezaedss[(NnCtChLr_ePaRdOsT)O,_ StIiMdPILnEB]l/oNcCkC(Lt_hSrTeEaPdSI/dsxi.zxe)o,f (gTr)o)u p{( g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p )| , group(group | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68 :56256 | : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here tid( t68i | d ) , nPtrhirmeiatdisv(enstd,I d0x,. xP)r,o tgor,o u0p>( gprroiumps) , | ^| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:S562T:E15P:S /warning: sinitializer order does not match the declaration order [-Wreorder-ctor]i zeof(T) )562 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| ( group(groupt id), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r68e:a56d:s (note: nin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested heret hrea d68s | ) , t iPdrIinmBiltoicvke(st,, 0| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ P r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t o, 0> 563p | r i m s s t| e ^p Size(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:c588c:l5S:h mnote: ein instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested herem .co m588m | . b u f frSuinzReisn[gN](/aNrCgCsL)_;S T E| P ^S /sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:(202T:)53): {note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 202 group(group | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hR:u68n:W56o:r knote: Ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herel emen t68< | F n , TP,r iRmeidtOipv,e sAF(a)n.Sryumnm(ewter)i;c < 1| > ^, 0, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cppo:t11o:,1 :0 >note: in instantiation of member function 'RunWork, 1, 2>::run' requested herep rims 11 | | I ^M PL_CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:L588_:F5U:N Cnote: (in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereA llR e588d | u c e , rRuInNRGi,n gSt()a r g| s^) ; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 391:95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: 391in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | Ru n202W | o r k < n c c l FRuunncW#o#rfkuEnlce,m etnytpo,t oN>C(C)L._rAuLnG(Ow_e#)#;a l g| o ^, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cppR:O12T:O1_:# #note: pin instantiation of member function 'RunWork, 1, 2>::run' requested herer oto> (12) | .IrMuPnL(_&CnOcLcLl_SFhUmNeCm(.AwlolrRke)d;u c\e , | R ^I NG, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P15L:E ,note: field 'nthreads' will be initialized after field 'tidInBlock'S um, d o562u | b l e ) t i| d^( tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391n:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd s(nthr e391a | d s )R,u ntWiodrIkn:,562 :N60C:C Lnote: _field 'group' will be initialized after field 'stepSize'A LGO_# #562a | l g o , tNiCdC(Lt_iPdR)O,T On_t#h#rperaodtso(>n(t)h.rreuand(s&)n,c ctliSdhImneBml.owcokr(kt)h;r e\a d I| d ^x .x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:( gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up), | 562 ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffS Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t da/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ta1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | sIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), t:idIn Bnote: lexpanded from macro 'IMPL_COLL_FUNC'o ck(threadI d391 | Ruxn.Wxo)r,k , tNiCdC(Lt_iAdL)G,O _n#t#harlegaod,s (NnCtChLr_ePaRdOsT)O,_ #t#ipdrIontBol>o(c)k.(rtuhnr(e&andcIcdlxS.hxm)e,m .gwroorukp)(;g r\o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :6562 | :I15MP:L _warning: Cinitializer order does not match the declaration order [-Wreorder-ctor]O LL_FUNC(AllRe d562u | c e , RtIiNdG(,t iSdI)M,P LnEt,h rPeraodds,( nitnhtr3e2a_dts)) ,| ^t idI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:B391l:o95c:k (note: texpanded from macro 'IMPL_COLL_FUNC'h readI d391x | . x )R,u ngWroorukp<(ngcrcoluFpu)n,c # #| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n c| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) type, F563u | n c # # dsetverpeSdiozpe<(tnycpcel>S,h mNeCmC.Lc_oAmLmG.Ob_u#f#faSligzoe,s [NNCCCCLL__PPRROOTTOO__#S#IpMrPoLtEo]>/(N)C.CrLu_nS(T&EnPcSc/lsSihzmeeomf.(wTo)r)k ){; \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68 :56256 | : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here tid(ti d68) | , n t hPrreiamdist(invtehsrd,x .0x,) ,P rgortoou,p (0g>r opurpi)m,s | | ^~~~~~~~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 588note: :field 'group' will be initialized after field 'stepSize'5 : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here562 | 588t | i d ( t irdu)n,R inntghB(laorcgks()t;h r e| a ^d Idx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | :I562M:P15L:_ Cwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]L L_FUNC(Al l562R | e d u c et,i dR(ItNiGd,) ,S InMtPhLrEe,a dPsr(ondt,h rueiandts6)4,_ tt)i d I| n^B lock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d Inote: dexpanded from macro 'IMPL_COLL_FUNC'x .x), g r391o | u p (RgurnoWuopr)k,< n c| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l F u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c ##func ,563 | t y p e ,s tFeupnSci#z#ed(envcrceldSohpmm,m .NbCuCfLf_SAiLzGeOs_[#N#CaClLg_oP,R ONTCOC_LS_IPMRPOLTEO]_/#N#CpCrLo_tSoT>E(P)S./rsuinz(e&onfc(cTl)S)h m{e m .| w ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o r k| ) group(group; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h68::56256::15 :note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herenote: field 'nthreads' will be initialized after field 'tidInBlock' 68 | 562 | P rtiimdi(ttiivde)s,< Tn,t hRreedaOdps,( nFtahnrSeyamdmse)t,r itcin,B l0o,c kP(rtohtroe,a d0I>d xp.rxi)m,s g r| o ^u p(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p588):,5 : | note: ^~~~~~~~~~~~~~~~~in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :588562 | : 60 : note: rfield 'group' will be initialized after field 'stepSize'u nRing <562T | , R e dtOipd,( tPirdo)t,o >n(tahrrgesa)d;s ( n| t ^h reads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t202i:d53I:n Bnote: lin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereo ck(t h202r | e a d I d x . x )R,u ngWroorukpE(lgermoeunpt)<,F n ,| ^~~~~~~~~~~T , RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cwarning: kinitializer order does not match the declaration order [-Wreorder-ctor]( threadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~d s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t daIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ta1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hims: 562 :| 15 ^: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :562588 | : 5 : note: tin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested herei d(tid )588, | n t h rreuandRsi(nngto(cakr(gtsh)r;e a d| I ^d x.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202(:g53r:o unote: pin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 202 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563R | u n W o rsktEelpeSmieznet(C(C)L._rPuRnO(TwOe_)S;I M P| L ^E ]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cppS:T5E:P1S:/ snote: iin instantiation of member function 'RunWork, 1, 2>::run' requested herez eof( T5) | )I M{P L _| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O L L| _ group(groupF UNC(AllReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :R68I:N56G:, note: Sin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereI MPLE ,68 | P r e M uPlrSiummi,t iuvienst<8T_,t )R e d| O^p , Fan/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:y391m:m95e:t rnote: iexpanded from macro 'IMPL_COLL_FUNC'c <1>, 0 ,391 | P r oRtuon,W o0r>k , ProtoSimple<2, 2>>' requested hereF unc #588# | d e v r erduonpR,, RNeCdCOLp_,A LPGrOo_t#o#>a(lagrog,s )N;C C L| _ ^P ROTO_##p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o202t:o53>:( )note: .in instantiation of member function 'RunWorkElement, 1, 2>::run' requested herer un(& n202c | c l S h m e m . wRournkW)o;r k\E l e| m ^e ntt(i)d.(rtuind()w,e )n;t h r| e ^a ds(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cppe:a5d:s1):, note: tin instantiation of member function 'RunWork, 1, 2>::run' requested herei dInB l5o | cIkM(PLt_hCrOeLaLd_IFdUxN.Cx()A,l lgRreoduupc(eg,r oRuIpN), | ^~~~~~~~~~~~~~~~~ G, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P60L:E ,note: field 'group' will be initialized after field 'stepSize'P reMul S562u | m , u itnitd8(_tti)d ) ,| ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391(:n95t:h rnote: eexpanded from macro 'IMPL_COLL_FUNC'a ds), ti d391I | n B lRoucnkW(otrhkr, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: iinitializer order does not match the declaration order [-Wreorder-ctor] d), nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. x )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) group(group )563, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ s t| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p Size( n563c | c l S h msetme.pcSoimzme.(bnucfcflSSihzmeesm[.NcCoCmLm_.PbRuOfTfOS_iSzIeMsP[LNEC]C/LN_CPCRLO_TSOT_ESPISM/PsLiEz]e/oNfC(CTL)_)S T{E P S| / ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s i z| e group(groupo f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 68in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here: 56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | 68P | r i m i tPirviemsir,i c0<,1 >P,r o0t,o ,P r0o>t op,r i0m>s p r| i ^m s | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 588in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here: 5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here588 | 588 | r u n R irnugnr(oatrog>s()a;r g s| ) ^; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53 :202 | note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202R | RuunnWWoorrkkEElleemmeenntt<>(())..rruunn((wwee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp::14:: 1note: :in instantiation of member function 'RunWork, 1, 2>::run' requested here note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | I M4P | LI_MCPOLL_LC_OFLULN_CF(UANlCl(RAeldluRceed,u cReI,N GR,I NSGI,M PSLIEM,P LPEr,e MPurleSMuuml,S uumi,n ti8n_tt8)_ t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h391::39195::95 :note: expanded from macro 'IMPL_COLL_FUNC'note: expanded from macro 'IMPL_COLL_FUNC' 391 | 391 | R uRnuWnoWrokre,> ,N CNCCLC_LA_LAGLOG_O#_##a#laglog,o ,N CNCCLC_LP_RPORTOOT_O#_##p#rportoot>o(>)(.)r.urnu(n&(n&cncclcSlhSmhemme.mw.owrokr)k;) ;\ \ | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6060:: note: note: field 'group' will be initialized after field 'stepSize'field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o15c:k (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]h readIdx.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~t hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h\: 562 :| 15 ^: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: 562note: | field 'nthreads' will be initialized after field 'tidInBlock' tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , | ^~~~~~~~~~~~~~~~~563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :s562t:e60p:S inote: zfield 'group' will be initialized after field 'stepSize'e (nccl S562h | m e m . ctoimdm(.tbiudf)f,S inztehsr[eNaCdCsL(_nPtRhOrTeOa_dSsI)M,P LtEi]d/INnCBClLo_cSkT(EtPhSr/esaidzIedoxf.(xT)),) g{r o u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( g r| o group(groupu p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:(562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~i dI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o60c:k (note: tfield 'group' will be initialized after field 'stepSize'h read I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkEleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 109 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Broadcast, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 109 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Broadcast, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElemIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ent().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ :21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :d514a:t9a:1 ,warning: variable 'offset' set but not used [-Wunused-but-set-variable]f lag1 ,514 | d a t a 2i,n tf loafgf2s;e t | = ^~~~~ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hd:;153 : 35| : ^ warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :153514 | : 9 : warning: uvariable 'offset' set but not used [-Wunused-but-set-variable]i nt32_ t514 | d a t a 1i,n tf loafgf1s,e td a=t at2i,d ;f l a| g ^2 ; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uinIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args7 warnings generated when compiling for gfx1102. ->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, daIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ ta2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizeIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkM,P LNEC]C/LN_CACLLG_OS_T#E#PaSl/gsoi,z eNoCfC(LT_)P)R O{T O _| # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~# p r| o group(groupt o>().run(&ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:m666.:w9o:r knote: )in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here; \ | ^666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :p15r:i mnote: sfield 'nthreads' will be initialized after field 'tidInBlock'( tid, n T562h | r e a d stGiadt(hteird,) ,d inrtehcrte-a>dusp(,n tNhUrLeLa,d sa)r,g st-i>dsIennBdlboucfkf(,t harregasd-I>drxe.cxv)b,u fgfr,o u p| ( ^g roup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^~~~~~~~~~~~~~~~~202 :53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 60: note: field 'group' will be initialized after field 'stepSize' 202 | 562 | R utniWdo(rtkiEdl)e,m enntthk(()t.hrruena(dwIed)x;. x )| , ^ group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppr:o5u:p1):, note: in instantiation of member function 'RunWork, 2, 2>::run' requested here| ^~~~~~~~~~~ 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:)562, :g15r:o uwarning: pinitializer order does not match the declaration order [-Wreorder-ctor]( group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 | 563 | t i d ( tsitde)p,S inzteh(rnecacdlsS(hnmtehmr.ecaodmsm).,b utfifdSIinzBelso[cNkC(CtLh_rPeRaOdTIOd_xS.IxM)P,L Eg]r/oNuCpC(Lg_rSoTuEpP)S,/ s i| z ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e o f| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T )) { | 563 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group stepSize(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:c655c:l11S:h mnote: ein instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem .comm .655b | u f f S i z e s [ N CpCrLi_mPsR(OtTiOd_-StIiMdPSLtEa]r/tNRCeCdLu_cSeT,E PnST/hsriezaedosfR(eTd)u)c e{, n| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l l p| t group(groupr , &direct->out, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:g641s:-11>:s enote: nin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered buff, a641r | g s - > r e c v b u fpfr,i m s| ( ^t id-tidS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:a202r:t53R:e dnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec e, n T202h | r e a d s R e d uRcuen,W odrikrEelcetm-e>ndtoout, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S15h:m ewarning: minitializer order does not match the declaration order [-Wreorder-ctor]. comm.buf f562S | i z e s [tNiCdC(Lt_iPdR)O,T On_tShIrMePaLdEs](/nNtChCrLe_aSdTsE)P,S /tsiidzIenoBfl(oTc)k)( t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I d| x group(group. x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:r677o:u11p:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)677 | 563 | psrtiempsS(itzied(-ntcicdlSSthamretmB.ccaosmtm,. bnuTfhfrSeiazdessB[cNaCsCtL,_ P&RdOiTrOe_cStI-M>PoLuEt],/ NdCiCrLe_cStT-E>dPoSw/ns,i zaerogfs(-T>)s)e n{d b u| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f , | group(group args->rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hv:b666u:f9f:, note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^ 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer ims( t202i | d , n T h r e aRdusnGWaotrhkeErl,e mdeinrte uTp,, RNeUdLOLp,, aArlggso-,> sPernodtbou>f(f),. raurng(sw-e>)r;e c v| b ^u ff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp ^: 6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: 6note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereI MPL_ C202O | L L _ F U N C ( ARlulnRWeodrukcEel,e mCeOnLtLt(D)i.vr,u ni(nwte3)2;_ t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::1391:: 95note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here note: expanded from macro 'IMPL_COLL_FUNC' 4 | IMP L391_ | C O LRLu_nFWUoNrCk(D,i vN,C CiLn_tA8L_GtO)_ # #| a^l go, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C391C:L95_:P Rnote: Oexpanded from macro 'IMPL_COLL_FUNC'T O_##pr o391t | o > (R)u.nrWuonr(k& | , N C CtLi_dA(LtGiOd_)#,# anltghor,e aNdCsC(Ln_tPhRrOeTaOd_s#)#,p rtoitdoI>n(B)l.orcukn((t&hnrcecaldSIhdmxe.mx.)w,o rgkr)o;u p\( g r| o ^u p), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~: 562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'nthreads' will be initialized after field 'tidInBlock': 60: note: field 'group' will be initialized after field 'stepSize' 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~) , | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~: 562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~g roup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 562 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | t563i | d ( t i ds)t,e pnStihzree(andcsc(lnSthhmreema.dcso)m,m .tbiudfIfnSBilzoecsk[(NtChCrLe_aPdRIOdTxO._xS)I,M PgLrEo]u/pN(CgCrLo_uSpT)E,P S /| s ^~~~~~~~~~~i zeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N C562C | L _ A L GtOi_d#(#taildg)o,, nNtChCrLe_aPdRsO(TnOt_h#r#epardost)o,> (t)i.drIunnB(l&oncckc(ltShhrmeeamd.Iwdoxr.kx));, \g r o| u ^p (group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 15 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: note: | field 'nthreads' will be initialized after field 'tidInBlock' tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562563 | | tsitde(ptSiidz)e,( nnctchlrSehamdesm(.nctohmrme.abdusf)f,S itziedsI[nNBClCoLc_kP(RtOhTrOe_aSdIIMdPxL.Ex])/,N CgCrLo_uSpT(EgPrSo/uspi)z,e o f| ( ^~~~~~~~~~~~~~~~~T )) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~60 : | note: group(groupfield 'group' will be initialized after field 'stepSize' 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:(687t:i11d:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren thread s687( | n t h r e a d s ) , ptriidmIsn(Btliodc-kt(itdhSrteaardtIBdcxa.sxt),, ngTrhoruepa(dgsrBocuaps)t,, &| d ^~~~~~~~~~~i rect->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 4warning: | initializer order does not match the declaration order [-Wreorder-ctor]I MPL_COLL _562F | U N C ( AtlildR(etdiudc)e,, nCtOhLrLeNaEdTs_(DnItRhErCeTa,d sS)I,M PtLiEd,I nSBulmoPcoks(ttDhirve,a diIndtx8._xt)), g| r^o up(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:)95,: note: | expanded from macro 'IMPL_COLL_FUNC' ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | 563 | R u n WsotrekpI,M PNLCEC]L/_NACLCGLO__S#T#EaPlSg/os,i zNeCoCfL(_TP)R)O T{O _ #| # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p r o| t group(groupo >().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h&:n677c:c11l:S hnote: min instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree m.work) ;677 | \ | ^ p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:i562m:s15(:t inote: dfield 'nthreads' will be initialized after field 'tidInBlock'- tidSt a562r | t B c a stti,d (ntTihdr)e,a dnstBhcraesatd,s (&ndtihrreecatd-s>)o,u tt,i ddIinrBelcotc-k>(dtohwrne,a daIrdgxs.-x>)s,e ngdrbouufpf(,g raorugps)-,> r e| c ^~~~~~~~~~~~~~~~~v buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 60 ^: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :562202 | : 53 : note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d(ti d202) | , n t h r e a dRsu(nnWtohrrkeEaldesm)e,n tto(u)p.(rgurno(uwpe)),; | | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562t:-15>:o uwarning: tinitializer order does not match the declaration order [-Wreorder-ctor], direct -562> | d o w n ,t iadr(gtsi-d>)s,e nndtbhurfefa,d sa(rngtsh-r>eraedcsv)b,u ftfi,d I n| B ^l ock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:d202x:.53x:) ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg roup(g r202o | u p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)W orkE l563e | m e n t f(f)S.irzuens([wNeC)C;L _ P| R ^O TO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp]:/4N:C1C:L _note: Sin instantiation of member function 'RunWork, 2, 2>::run' requested hereT EPS/si z4e | oIfM(PTL)_)C O{L L _| F ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~U N C| ( group(groupA llReduce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 687C:O11L:L Nnote: Ein instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT _DIRE C687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I15n:B lwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]c k(threadId x562. | x, ) , N CgtCriLod_u(AptL(group),iG dO )_| ,# ^~~~~~~~~~~ # natlhgroe,a dNsC(CnLt_hPrReOaTdOs_)#,# ptriodtIon>B(l)o.crku(nt(h&rnecacdlISdhxm.exm).,w ogrrko)u;p (\g r o| u ^p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 563note: | field 'nthreads' will be initialized after field 'tidInBlock' stepS i562z | e ( n c ctliSdh(mteimd.)c,o mnmt.hbruefafdSsi(znetsh[rNeCaCdLs_)P,R OtTiOd_ISnIBMlPoLcEk](/tNhCrCeLa_dSITdExP.Sx/)s,i zgeroofu(pT()g)r o{u p )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 666note: :field 'group' will be initialized after field 'stepSize'9 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 666 | t i d ( t i dp)r,i mnst(htrieda,d sn(TnhtrheraedasdGsa)t,h etri,d IdniBrleocctk-(>tuhpr,e aNdUILdLx,. xa)r,g sg-r>osuepn(dgbruofufp,) ,a r g| s ^~~~~~~~~~~- >recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thrumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):)562 :{15 : | warning: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~initializer order does not match the declaration order [-Wreorder-ctor] | group(group 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:)666,: 9n:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads(n t666h | r e a d s ) , tpirdiImnsB(ltoicdk,( tnhTrheraedaIddsxG.axt)h,e rg,r oduipr(egcrto-u>pu)p,, N| U ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L L ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a rgs- >563s | e n d b usftfe,p Sairzges(-n>crcelcSvhbmuefmf.,c o m| m ^. buffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:[202N:C53C:L _note: Pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR OTO_ S202I | M P L E ] / N C CRLu_nSWToErPkSE/lseimzeenotf<(FTn),) T{, R| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d O p| , group(group Algo, Proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:)666.:r9u:n (note: win instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ); | ^666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp : 5 :p1r:i mnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here( tid, 5n | TIhMrPeLa_dCsOGLaLt_hFeUrN,C (dAilrleRcetd-u>cuep,, CNOULLLLN,E Ta_rDgIsR-E>CsTe,n dSbIuMfPfL,E ,a rSgusm-P>orsetcDvibvu,f fu,i n t| 8 ^_ t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::53391:: 95note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here note: expanded from macro 'IMPL_COLL_FUNC' 202 | 391 | RunWork,, PNrCoCtLo_>A(L)G.Or_u#n#(awleg)o;, N| C ^C L_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp_:#5#:p1r:o tnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested here> ().ru n5( | &InMcPcLl_SChOmLeLm_.FwUoNrCk()A;l l\R e d| u ^c e, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:N562E:T15_:D Inote: Rfield 'nthreads' will be initialized after field 'tidInBlock'E CT, S562I | M P L E ,t iSdu(mtPiods)t,D invt,h rueiandts8(_ntt)h r e| a^d s), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:I95n:B lnote: oexpanded from macro 'IMPL_COLL_FUNC'c k(thre a391d | I d xR.uxn)W,o rgkrt,i dN)C,C Ln_tAhLrGeOa_d#s#(anltghor,e aNdCsC)L,_ PtRiOdTIOn_B#l#opcrko(ttoh>r(e)a.drIudnx(.&xn)c,c lgSrhomuepm(.gwroorukp));, \ | ^~~~~~~~~~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:I dwarning: xinitializer order does not match the declaration order [-Wreorder-ctor]. x), grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n thre a563d | s ( n t hsrteeapdSsi)z,e (tnicdcIlnSBhlmoecmk.(ctohmrme.abduIfdfxS.ixz)e,s [gNrCoCuLp_(PgRrOoTuOp_)S,I M P| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~E ] /| N tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C CL_S T563E | P S / s iszteeopfS(iTz)e)( n{c c l| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h m e| m group(group. comm.buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h[:N687C:C11L:_ Pnote: Rin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO TO_SI M687P | L E ] / N C C L _ S TpErPiSm/ss(itziedo-ft(iTd)S)t a{r t B| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a s t| , group(group nThreadsBcas/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:,666 :&9d:i rnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec t->o u666t | , n u l l p t rp,r iamrsg(st-i>ds,e nndTbhurfefa,d saGragtsh-e>rr,e cdvibruefcft,- > u| p ^, NULL, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:r202g:s53-:> snote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren dbuf f202, | a r g s - > r eRcuvnbWuofrfk,E l e| m ^e nt, 2, 2>::run' requested here, Alg o202, | P r o t o > ( )R.urnuWno(rwkeE)l;e m e| n ^t , 2, 2>::run' requested here Algo ,4 | PIrMoPtLo_>C(O)L.Lr_FUNC(AllReudnu(cwee,) ;C O L| L ^N ET_DIRE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppC:T7,: 1S:I Mnote: Pin instantiation of member function 'RunWork, 2, 2>::run' requested hereL E, S u7m | PIoMsPtLD_iCvO,L Li_nFtU8N_Ct()A l l| R^e duce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391C:O95L:L Nnote: Eexpanded from macro 'IMPL_COLL_FUNC'T _DIRE C391T | , SRIuMnPWLoEr,k ,391 | N C CRLu_nAWLoGrOk_<#n#cacllgFou,n cN#C#CfLu_nPcR,O TtOy_p#e#,p rFoutnoc>#(#)d.ervurne(d&onpce,m .NwCoCrLk_)A;L G\O _ #| # ^a lgo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:P Rnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'T O_## p562r | o t o > (t)i.dr(utni(d&)n,c cnltShhrmeeamd.sw(onrtkh)r;e a\d s )| , ^ tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s )note: ,field 'group' will be initialized after field 'stepSize' tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~i dInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k60(:t hnote: rfield 'group' will be initialized after field 'stepSize'e adId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~d s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:s15e:n dwarning: binitializer order does not match the declaration order [-Wreorder-ctor]u ff, args -562> | r e c v btuifdf(,t i d| ) ^, nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres ), ti d202I | n B l o c k ( t hRrunWorkElement().run(ewaed)I;d x .| x ^) , group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppg:r5o:u1p:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)5 | IMPL _563C | O L L _ FsUtNeCp(SAilzleR(endcuccleS,h mCeOmL.LcNoEmTm_.DbIuRfEfCSTi,z eSsI[MNPCLCEL,_ PSRuOmTPOo_sStIDMiPvL,E ]u/iNnCtC8L__tS)T E P| S^/ sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:(391T:)95): {note: expanded from macro 'IMPL_COLL_FUNC' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 391 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:r677k:<11n:c cnote: lin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereF unc## f677u | n c , t y p e , Fpurnicm#s#(dteivdr-etdiodpSc,a sNtC,C Ln_TAhLrGeOa_d#s#Baclagsot,, N&CdCiLr_ePcRtO-T>Oo_u#t#,p rdoitroe>c(t)-.>rduonw(n&,n cacrlgSsh-m>esme.nwdobrukf)f;, \a r g| s ^- >recvb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:f562f:,15 : | note: ^field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 202 : 53 : tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered (tid )202, | n t h r e a d sR(unntWhorrekaEdlse)m,e nttiu(p)(.grruonu(pw)e,) ; | ^~~~~~~~~~~~~~~~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp::607:: 1note: :field 'group' will be initialized after field 'stepSize' note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 7 | I M PtLi_dC(OtLiLd_)F,U NnCt(hArlelaRdesd(uncteh,r eCaOdLsL)N,E Tt_iDdIIRnEBClTo,c kS(ItMhPrLeEa,d ISduxm.Pxo)s,t Dgirvo,u pu(ignrto3u2p_)t,) | | ^~~~~~~~~~~^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~~~~~~~d x.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:) ,note: field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562E:,15 :S uwarning: minitializer order does not match the declaration order [-Wreorder-ctor]P ostDiv, u562i | n t 8 _ tt)i d (| t^i d), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC'( nthre a391d | s ) ,R utniWdoIrnkB, N563C | C L _ A LsGtOe_p#S#iazleg(on,c cNlCSChLm_ePmR.OcToOm_m#.#bpurfoftSoi>z(e)s.[rNuCnC(L&_nPcRcOlTSOh_mSeImM.PwLoEr]k/)N;C C\L _ S| T ^E PS/siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:o562f:(15T:) )note: field 'nthreads' will be initialized after field 'tidInBlock'{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562| | group(group tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :n687t:h11r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (nthr e687a | d s ) , t i d I n Bplroicmks((tthirde-atdiIddSxt.axr)t,B cgarsotu,p (ngTrho/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]u p(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h)562, | : 562 | : ^~~~~~~~~~~ t15i:d (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~x ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g roup (563g | r o u p )s,t e p| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i z e| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n cclS h563m | e m . c osmtme.pbSuifzfeS(inzcecsl[SNhCmCeLm_.PcRoOmTmO._bSuIfMfPSLiEzereadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15: warning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 563| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ste p563S | i z e ( nsctcelpSShimzeem(.nccocmlmS.hbmuefmf.Sciozmems.[bNuCfCfLS_iPzReOsT[ON_CSCILM_PPLREO]T/ON_CSCILM_PSLTEE]P/SN/CsCiLz_eSoTfE(PTS)/)s i{z e o| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( T )| ) group(group { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 687655: | 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here pr i687m | s ( t i d - t i d S tparritmRse(dtuicde-,t indTShtraeratdBscRaesdtu,c en,T hnruelaldpstBrc,a s&td,i r&edcitr-e>cotu-t>,o uatr,g sn-u>lslepntdrb,u fafr,g sa-r>gsse-n>drbeucfvfb,u fafr,g s -| > ^r ecvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:,202 : 53| : ^ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :20253 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here R u202n | W o r k E l e m eRnutnA(l)g.or,u nP(rwoet)o;> ( )| . ^r un(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppe:)5;: 1 :| ^note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppI:M6P:L1_:C Onote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _FUNC (6A | lIlMRPeLd_uCcOeL,L _CFOULNLCN(EATl_lDRIeRdEuCcTe,, SCIOMLPLLNEE,T _SDuImRPEoCsTt,D iSvI,M PuLiE,n tS8u_mtP)o s t| D^i v, i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t3913::295562_:: 15note: :expanded from macro 'IMPL_COLL_FUNC' warning: initializer order does not match the declaration order [-Wreorder-ctor] t) 391 | | ^ RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h k:562<391 | n :c 95c :l Ftnote: uexpanded from macro 'IMPL_COLL_FUNC'ind c(#t#ifdu)n ,c391 , | n tt hyRrpueena,Wd osFr(uknnto,yc pkNe(C,tC hLFr_euAanLdcGI#Od#_xd#.e#xva)rl,eg dogo,rp oPo,Ru OpNT)CO,C_ L# _#| Ap ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~Lr Go Ot| _o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)#> #(a)l.gr ou563,n | ( N& Cn Cc Lcs_ltPSeRhpOmSTeiOmz_.e#w(#onprcrkco)lt;So h>\m( e) m.| .r ^cu onm(m&./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnb:cu562cf:lf15SS:hi mznote: eefield 'nthreads' will be initialized after field 'tidInBlock'ms .[wNoCr Ck562L) | _; P R\ O T tO| i_ ^dS (ItMiPdL)E/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,]: /562nN:tC15hC:rL e_note: aSfield 'nthreads' will be initialized after field 'tidInBlock'dT sE(PnSt/ hs562ri | ez ae do sf )(t,Ti )dt)(i td{iI dn )B| ,l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ o nc tk| h( group(grouprt heraedasd(Indtxh.rxe)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha,:d 641sg:)r11,o: u tpnote: i(in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heredg IrnoBulpo )c641,k | ( t | h ^~~~~~~~~~~~~~~~~r e a d /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI :d 562x :.p60xr:)i ,mnote: sfield 'group' will be initialized after field 'stepSize'g( rtoiudp -(562tg | ir do Su tp a)tr,it dR (e| td ^~~~~~~~~~~~~~~~~iu dc)e,, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h n:nt562Th:hr60re:ea adnote: dsfield 'group' will be initialized after field 'stepSize's( Rnetdhu rc562ee | a, d sd )i ,rt eitcdit(d-tI>inddBo)lw,on c,nk t(&htdrhierraeedacsdt-(I>ndotxuh.trx,e) a,ad rsgg)rs,o- u>tpsi(edgnIrdnobBuulpfo)fc,,k ( at| rh ^~~~~~~~~~~gr se-a>drIedcxv.bxu)f,f ,g r o| u ^p (group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ^~~~~~~~~~~: 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'RunWork, 2, 2>::run' requested here: 562:15: 6warning: | initializer order does not match the declaration order [-Wreorder-ctor]I MPL_COLL_ F562U | N C ( A ltliRde(dtuicde),, CnOtLhLrNeEaTd_sD(InRtEhCrTe,a dSsI)M,P LtEi,d ISnuBmlPoocskt(Dtihvr,e aidnItd3x2._xt)), g| r^o up(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p95):, note: expanded from macro 'IMPL_COLL_FUNC'| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | R563u | n W o r ksT,O _NSCICMLP_LAEL]G/ON_C#C#La_lSgToE,P SN/CsCiLz_ePoRfO(TTO)_)# #{p r o| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o > (| ) group(group. run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:l626S:h9m:e mnote: .in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herew ork); \626 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :p562r:i15m:s (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d-tid S562t | a r t S ctaitdt(etri,d )n,T hnrtehardesaSdcsa(tnttehrr,e aNdUsL)L,, tdiidrIencBtl-o>cukp(,t harregasd-I>dsxe.nxd)b,u fgfr,o uapr(ggsr-o>urpe)c,v b u| f ^~~~~~~~~~~~~~~~~f , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hfield 'group' will be initialized after field 'stepSize': 202:53: note: 562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | t202i | d ( t i d ) , nRtuhnrWeoardksE(lnetmhernetax()),. rgurno(uwpe()g;r o u| p ^) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:,562 :n15u:l lwarning: pinitializer order does not match the declaration order [-Wreorder-ctor]t r, &dire c562t | - > o u tt,i da(rtgisd-)>,s enntdhbruefafd,s (anrtghsr-e>ardesc)v,b utfifd,I n B| l ^o ck(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202d:I53d:x .note: xin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) , gr o202u | p ( g r o u p ) ,R u n| W ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o r k| E tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l ement< F563n | , T , sRteedpOSpi,z eA(lngcoc,l SPhrmoetmo.>c(o)m.mrun(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here .bu f7f | SIiMzPeLs_[CNOCLCLL__FPURNOCT(OA_lSlIRMePdLuEc]e/,N CCCOLL_LSNTEETP_SD/IsRiEzCeTo,f (STI)M)P L{E , | S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u m P| o group(groups tDiv, ui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:t6773:211_:t )note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h677: | 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' pri m391s | ( t iRdu-ntWiodrSktdooupt<,t ydpier>e,c tN-C>CdLo_wAnL,G Oa_r#g#sa-l>gsoe,n dNbCuCfLf_,P RaOrTgOs_-#>#rpercovtbou>f(f),. r u| n ^( &ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:e202m:.53w:o rnote: kin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) ; \ 202| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :R15u:n Wnote: ofield 'nthreads' will be initialized after field 'tidInBlock'r kElem e562n | t < F n ,t iTd,( tRiedd)O,p ,n tAhlrgeoa,d sP(rnotthor>e(a)d.sr)u,n (twied)I;n B l| o ^c k(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppd:I7d:x1.:x )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here group (7g | rIoMuPpL)_,C O L| L ^~~~~~~~~~~~~~~~~_ FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:(562A:l60l:R enote: dfield 'group' will be initialized after field 'stepSize'u ce, C O562L | L N E T _tDiIdR(EtCiTd,) ,S InMtPhLrEe,a dSsu(mnPtohsrteDaidvs,) ,u itnitd3I2n_Btl)o c k| (^t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:d391x:.95x:) ,note: expanded from macro 'IMPL_COLL_FUNC'g roup(gr o391u | p ) ,R u n| W ^~~~~~~~~~~o rk, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562e:p15S:i zwarning: einitializer order does not match the declaration order [-Wreorder-ctor]( ncclShmem.c o562m | m . b u ftfiSdi(zteisd[)N,C CnLt_hPrReOaTdOs_(SnItMhPrLeEa]d/sN)C,C Lt_iSdTIEnPBSl/oscikz(etohfr(eTa)d)I d{x . x| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, g| r group(groupo up(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:)641,: 11 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 641 | 563 | s t e ppSriizme(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g r o u ps(tgerpoSuipz)e,( n c| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l S h| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e m.com m563. | b u f f SsitzeepsS[iNzCeC(Ln_cPcRlOSThOm_eSmI.McPoLmEm]./bNuCfCfLS_iSzTeEsP[SN/CsCiLz_ePoRfO(TTO)_)S I{M P L| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~] / N| C group(groupC L_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 626 group(group: 9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: 626note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641p | r i m s ( t i d - t ipdrSitmasr(ttSicda-tttiedrS,t anrTthRreedaudcseS,c antTtherre,a dNsURLeLd,u cdei,r edcitr-e>cutp-,> daorwgns,- >&sdeinrdebcutf-f>,o uatr,g sa-r>grse-c>vsbeunfdfb,u f f| , ^ args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:v202b:u53f:f ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : Rnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Work E202l | e m e n t < F n ,R uTn,W oRrekdEOlpe,m eAnltge(d)O.pr,u nA(lwgeo),; P r| o ^t o>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppu:n6(:w1e:) ;note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^ 6 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppP:L6_:C1O:L Lnote: _in instantiation of member function 'RunWork, 2, 2>::run' requested hereF UNC(A l6l | RIeMdPuLc_eC,O LCLO_LFLUNNECT(_ADlIlRREeCdTu,c eS,I MCPOLLEL,N ESTu_mDPIoRsEtCDTi,v ,S IiMnPtL3E2,_ tS)u m P| o^s tDiv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391i:n95t:3 2note: _expanded from macro 'IMPL_COLL_FUNC't ) | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391R:u95n:W onote: rexpanded from macro 'IMPL_COLL_FUNC'k #,d eNvCrCeLd_oApLa,l gNoC,C LN_CACLLG_OP_R#O#TaOl_g#o#,p rNoCtCoL>_(P)R.OrTuOn_(#&#npcrcoltSoh>m(e)m..rwuonr(k&)n;c c\l S h| m ^e m.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):;562 :\15 : | note: ^field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i60d:( tnote: ifield 'group' will be initialized after field 'stepSize'd ), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~r oup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::666677::911:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | 677 | p r i mpsr(itmisd(,t indT-htriedaSdtsaGrattBhcears,t ,d inrTehcrte-a>duspB,c aNsUtL,L ,& dairrgesc-t>-s>eonudtb,u fdfi,r eacrtg-s>-d>orwenc,v baurfgfs,- > s| e ^n dbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a202r:g53s:- >note: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree cvbu f202f | , | ^ RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hk:E202l:e53m:e nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here< Fn, T202, | R e d O p , ARlugnoW,o rPkrEolteom>e(n)t.1(:) .note: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereu n(we )7; | I M| P ^L _COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppU:N7C:(1A:l lnote: Rin instantiation of member function 'RunWork, 2, 2>::run' requested heree duce, 7C | OILMLPNLE_TC_ODLILR_EFCUTN,C (SAIlMlPRLeEd,u cSeu,m PCoOsLtLDNiEvT,_ DuIiRnEtC3T2,_ tS)I M P| L^E , Su/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:P391o:s95t:D inote: vexpanded from macro 'IMPL_COLL_FUNC', uint3 2391_ | t ) R u| n^W orkc,, NtCyCpLe_,A LFGuOn_c####adlegvor,e dNoCpCT,O _N#C#CpLr_oAtLoG>O(_)#.#raulng(o&,n cNcClCSLh_mPeRmO.TwOo_r#k#)p;r o\t o >| ( ^) .run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:&562n:c15c:l Snote: hfield 'nthreads' will be initialized after field 'tidInBlock'm em.wo r562k | ) ; \ t i| d ^( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's (nthr ea562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~~~~~~~t hre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562I:d60x:. xnote: )field 'group' will be initialized after field 'stepSize', grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~~~~~~~d ), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d snote: (field 'group' will be initialized after field 'stepSize'n threa d562s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpn)B,l o c| k ^~~~~~~~~~~( threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]562 | tid(tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( group )563, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ s t| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p Size (563n | c c l S hsmteemp.Sciozmem(.nbcucflfSShimzeems.[cNoCmCmL._bPuRfOfTSOi_zSeIsM[PNLCEC]L/_NPCRCOLT_OS_TSEIPMSP/LsEi]z/eNoCfC(LT_)S)T E{P S /| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i z e| o group(groupf (T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h group(group: 666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :666626 | : 9 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here pri m626s | ( t i d , n T hprreiamdss(Gtaitdh-etri,d SdtiarretcStc-a>tutpe,r ,N UnLTLh,r eaardgssS-c>astetnedrb,u fNfU,L La,r gdsi-r>ercetc-v>buupf,f ,a r g| s ^- >sendbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:,202 :a53r:g snote: -in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here> recv b202u | f f , | ^ RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hk:E202l:ement53<:F nnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here T, R202e | d O p , A l g oR,u nPWroortkoE>l(e)m.ernutn<(Fwne,) ;T , | R ^e dOp, A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppl:g6o:,1 :P rnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested heret o>() .6r | uInM(PwLe_)C;O L L| _ ^F UNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppl:R8e:d1u:c enote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here COLL N8E | TI_MDPILR_ECCOTL,L _SFIUMNPCL(EA,l lSRuemdPuocset,D iCvO,L LiNnEtT3_2D_ItR)E C T| ,^ SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:,391 :S95u:m Pnote: oexpanded from macro 'IMPL_COLL_FUNC's tDiv, i391n | t 6 4R_utn)W o r| k^< nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hF:u391n:c95#:# fnote: uexpanded from macro 'IMPL_COLL_FUNC'n c, typ e391, | F uRnucn#W#odrekvf,u nNcC,C Lt_yApLeG,O _F#u#nacl#g#od,e vNrCeCdLo_pP#,# pNrCoCtLo_>A(L)G.Or_u#n#(a&lngcoc,l SNhCmCeLm_.PwRoOrTkO)_;# #\p r o| t ^o >().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:&562n:c15c:l Snote: hfield 'nthreads' will be initialized after field 'tidInBlock'm em.wo r562k | ) ; \ t i| d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^(: t562i:d15):, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d :s56215( | :n t h note: r efield 'nthreads' will be initialized after field 'tidInBlock'ta idds()t,i d562t) | i, d I nn tB hlrotecaikd(dst((htnritedhar)deI,ad dxns.t)xh,) ,tr igerdoaIundpBs(l(gorcnoktu(hptr)he,ra e da| sd ^~~~~~~~~~~~~~~~~)I ,d /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hxt:.562ix:d)60I,:n Bgnote: rlfield 'group' will be initialized after field 'stepSize'oo ucpk( (g562rt | oh ur p e) a,td iI d| d( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~tx i. dx| )) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), , n gt563rh | or ue pa (d gss(rtnoetuphpSr)iez,ae d(s n)| c, ^~~~~~~~~~~~~~~~~c ltSihdm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hIe:nm562B.:lc60oo:cm kmnote: (.field 'group' will be initialized after field 'stepSize'tb hurfefaSd iIz562de | xs. [x N) C, C Ltg_irPdoRu(OptT(iOgd_r)SoI,uM pP)nL,tE h] r/| Ne ^~~~~~~~~~~Ca CdLs_(SnTtEhPrSe/asdisz)e,o ft(iTd)I)n B{l o c| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t h| r group(groupe adIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677g:r11o:u pnote: (in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg roup), 677 | | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L15E:, warning: Sinitializer order does not match the declaration order [-Wreorder-ctor]u mPostDiv, i562n | t 3 2 _ tt)i d (| t^i d), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d snote: (expanded from macro 'IMPL_COLL_FUNC'n threads )391, | t iRduInnWBolrokc ,563 | N C C L _sAtLeGpOS_i#z#ea(lngcoc,l SNhCmCeLm_.PcRoOmTmO._b#u#fpfrSoitzoe>s([)N.CrCuLn_(P&RnOcTcOl_SShImMePmL.Ew]o/rNkC)C;L _\S T E| P ^S /sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:)562): 15{: note: | field 'nthreads' will be initialized after field 'tidInBlock' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d655(:t11i:d )note: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here nthrea d655s | ( n t h r e a d s ) ,p rtiimdsI(ntBildo-ctki(dtShtraeratdRIeddxu.cxe),, ngTrhoruepa(dgsrRoeudpu)c,e , | n ^~~~~~~~~~~~~~~~~u llp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:r562,: 60&:d inote: rfield 'group' will be initialized after field 'stepSize'e ct-> o562u | t , tairdg(st-i>ds)e,n dnbtuhfrfe,a dasr(gnst-h>rrecvbufefa,d s )| , ^ tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:c202k:(53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea dIdx .202x | ) , g r o u p (RgurnoWuopr)k,E l e| m ^~~~~~~~~~~e nt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h RunW:o562r:k15E:l ewarning: minitializer order does not match the declaration order [-Wreorder-ctor]e nt (n)t.hrruena(dwse()n;t h r| e ^a ds),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp :t7i:d1I:n Bnote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereo ck( t7h | rIeMaPdLI_dCxO.LxL)_,F UgNrCo(uApl(lgRreoduupc)e,, C| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L L N| E tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T _DIR E563C | T , S IsMtPeLpES,i zSeu(mnPcocsltSDhimve,m .ucionmtm3.2b_utf)f S i| z^e s[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_391P:R95O:T Onote: _expanded from macro 'IMPL_COLL_FUNC'S IMPLE] /391N | C C LR_uSnTWEoPrSk/, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here e>, N687C | C L _ A L G O _ # # aplrgiom,s (NtCiCdL-_tPiRdOSTtOa_r#t#Bpcraostto,> (n)T.hrruena(d&snBcccalsSth,m e&md.iwroerckt)-;> o\u t ,| ^n ullpt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:,562 :a15r:g snote: -field 'nthreads' will be initialized after field 'tidInBlock'> sendb u562f | f , a rgs-t>irde(ctvibdu)f,f ,n t h| r ^e ads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53s:) ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret idIn B202l | o c k ( t h r e aRduIndWxo.rxk)E,l egmreonutp<(Fgnr,o uTp,) ,R e d| O ^~~~~~~~~~~~~~~~~p , Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:o562,: 60P:r onote: tfield 'group' will be initialized after field 'stepSize'o >().r u562n | ( w e ) ;t i d| ( ^t id), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppt:h6r:e1a:d snote: (in instantiation of member function 'RunWork, 2, 2>::run' requested heren thre a6d | sI)M,P Lt_iCdOILnLB_lFoUcNkC((tAhlrleRaeddIudcxe.,x )C,O LgLrNoEuTp_(DgIrRoEuCpT),, S I| M ^~~~~~~~~~~P LE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :S562u:m15P:o swarning: tinitializer order does not match the declaration order [-Wreorder-ctor]D iv, int 65624 | _ t ) t| i^d (tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391n:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd s(nthrea d391s | ) , RtuindWIonrBkl, 563N | C C L _ AsLtGeOp_S#i#zael(gnoc,c lNSChCmLe_mP.RcOoTmOm_.#b#upfrfoStioz>e(s)[.NrCuCnL(_&PnRcOcTlOS_hSmIeMmP.LwEo]r/kN)C;C L\_ S T| E ^P S/sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:(562T:)15): {note: field 'nthreads' will be initialized after field 'tidInBlock' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i687d:(11t: inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , nth r687e | a d s ( n t h r e a dpsr)i,m st(itdiIdn-BtliodcSkt(atrhtrBecaadsItd,x .nxT)h,r egardosuBpc(agsrto,u p&)d,i r e| c ^~~~~~~~~~~~~~~~~t ->out/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:u60l:l pnote: tfield 'group' will be initialized after field 'stepSize'r , args- >562s | e n d b utfifd,( tairdg)s,- >nrtehcrvebaudfsf(,n t h| r ^e ads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:I53n:B lnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec k(th r202e | a d I d x . x ) ,R ugnrWoourpk(Eglreomuepn)t,< F n| , ^~~~~~~~~~~ T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: pSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNETf, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthre_aDdsI(RnEtChTr,e aSdIsM)P,L Et,i dSIunmBPloosctkD(itvh,r eiandtI6d4x_.tx)) , | g^r oup(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 95 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: note: | expanded from macro 'IMPL_COLL_FUNC' tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 391 | RsutneWpoSrikzP,L EN]C/CNLC_CALL_GSOT_E#P#Sa/lsgioz,e oNfC(CTL)_)P R{O T O| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~# # p| r group(groupo to>().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h&:n655c:c11l:S hnote: min instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree m.work) ;655 | \ | ^ pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:m562s:(15t:i dnote: -field 'nthreads' will be initialized after field 'tidInBlock't idSta r562t | R e d u ctei,d (ntTihdr)e,a dnstRherdeuacdes,( nntuhlrlepatdrs,) ,& dtiirdeIcntB-l>oocukt(,t harregasd-I>dsxe.nxd)b,u fgfr,o uapr(ggsr-o>urpe)c,v b u| f ^~~~~~~~~~~~~~~~~f , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :56253 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tid(t i202d | ) , n t h r e aRdusn(WnotrhkrEelaedmse)n,t r(o)u.pr(ugnr(owuep));, | | ^ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562-:t15i:d Swarning: tinitializer order does not match the declaration order [-Wreorder-ctor]a rtReduce, n T562h | r e a d stRiedd(utcied,) ,d inrtehcrte-a>ddso(wnnt,h r&edaidrse)c,t -t>ioduItn,B laorcgks(-t>hsreenaddbIudfxf.,x )a,r ggsr-o>urpe(cgvrbouufpf),, | | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :563 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here s202t | e p S i z e ( n cRculnSWhomrekmE.lceommemn.tb](/)N.CrCuLn_(SwTeE)P;S / s| i ^z eof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppT:)7): 1{: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::15562:: 15note: :field 'nthreads' will be initialized after field 'tidInBlock' warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~) , | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 562 :| 60 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: field 'group' will be initialized after field 'stepSize' 563 | 562 | s t e ptSiidz(et(indc)c,l Snhtmherme.acdosm(mn.tbhurfefaSdisz)e,s [tNiCdCILn_BPlRoOcTkO(_tShIrMePaLdEI]d/xN.CxC)L,_ SgTrEoPuSp/(sgirzoeuopf)(,T ) )| ^~~~~~~~~~~{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h Ru:n562W:o15r:k Ewarning: linitializer order does not match the declaration order [-Wreorder-ctor]e ment (n)t.hrruena(dwse()n;t h r| e ^a ds), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppI:n7B:l1o:c knote: (in instantiation of member function 'RunWork, 2, 2>::run' requested heret hrea d7I | dIxM.PxL)_,C OgLrLo_uFpU(NgCr(oAulpl)R,e d u| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e , | C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O LLNET _563D | I R E C Ts,t eSpISMiPzLeE(,n cSculmSPhomsetmD.icvo,m mu.ibnutf3f2S_itz)e s [| N^C CL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:O391T:O95_:S Inote: Mexpanded from macro 'IMPL_COLL_FUNC'P LE]/NC C391L | _ S TREuPnSW/osrikzin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, NCCL_ A677L | G O _ # # a l g o , pNrCiCmLs_(PtRiOdT-Ot_i#d#SptraorttoB>c(a)s.tr,u nn(T&hnrcecaldSshBmceams.tw,o r&kd)i;r e\c t -| > ^o ut, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:i562r:e15c:t -note: >field 'nthreads' will be initialized after field 'tidInBlock'd own, a562r | g s - > steindd(btuifdf),, anrtghsr-e>ardesc(vnbtuhfrfe,a d s| ) ^, tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:o202c:k53(:t hnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree adId x202. | x ) , g r o u pR(ugnrWoourpk)E,l e m| e ^~~~~~~~~~~~~~~~~n ti(d)).,r unnt(hwree)a;d s (| n ^t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpps:)8,: 1t:i dnote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested heren Bloc k8( | tIhMrPeLa_dCIOdLxL._xF)U,N Cg(rAolulpR(egdruocuep,) ,C O L| L ^~~~~~~~~~~N ET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NC C562L | _ A L G Ot_i#d#(atligdo),, NnCtChLr_ePaRdOsT(On_t#h#rperaodtso)>,( )t.irduInn(B&lnoccckl(Sthhmreema.dwIodrxk.)x;) ,\ g r| o ^u p(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 15 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: field 'nthreads' will be initialized after field 'tidInBlock' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t isdt(etpiSdi)z,e (nntchcrleSahdmse(mn.tchormema.dbsu)f,f StiizdeIsn[BNlCoCcLk_(PtRhOrTeOa_dSIIdMxP.LxE)],/ NgCrCoLu_pS(TgErPoSu/ps)i,z e o| f ^~~~~~~~~~~~~~~~~( T)) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h{: 562 :| 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | field 'group' will be initialized after field 'stepSize' group(group 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:(666t:i9d:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren thre a666d | s ( n t h r e a dpsr)i,m st(itdiIdn,B lnoTchkr(etahdrseGaadtIhdexr.,x )d,i rgercotu-p>(ugpr,o uNpU)L,L , | a ^~~~~~~~~~~r gs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork , tNiCdC(Lt_iAdL)G,O _n#t#harlegaod,s (NnCtChLr_ePaRdOsT)O,_ #t#ipdrIontBol>o(c)k.(rtuhnr(e&andcIcdlxS.hxm)e,m .gwroorukp)(;g r\o u p| ) ^, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :563 | note: field 'nthreads' will be initialized after field 'tidInBlock' ste p562S | i z e ( ntcicdl(Sthimde)m,. cnotmhmr.ebaudfsf(Snitzherse[aNdCsC)L,_ PtRiOdTIOn_BSlIoMcPkL(Et]h/rNeCaCdLI_dSxT.ExP)S,/ sgirzoeuopf((gTr)o)u p{) , | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :677:11: 562note: | in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tid (677t | i d ) , n t h r e apdrsi(mnst(htrieda-dtsi)d,S ttaidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rtBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)15,: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ endbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx908. 43 warnings generated when compiling for gfx940. 43 warnings generated when compiling for gfx941. 43 warnings generated when compiling for gfx90a. 43 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx1101. 43 warnings generated when compiling for gfx1100. 43 warnings generated when compiling for gfx803. 43 warnings generated when compiling for gfx906. 43 warnings generated when compiling for gfx900. 43 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid),7 warnings generated when compiling for gfx900. nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tidIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.coIn file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cppm:.1b: uIn file included from f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:S10i: zIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hs:[167N: C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Twarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]_ SIMPLE]/NC C562L | _ S T E PtSi/ds(itziedo)f,( Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( n t| h group(groupr eads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i916d:I7n:B lnote: oin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herec k(th r916e | a d I d x . xp)r,i mgsr(ogurpo(ugprToiudp,) ,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p N t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads, 563& | r e c v ,s t&espeSnidz,e (anrcgcsl-S>hsmeenmd.bcuofmfm,. baurfgfsS-i>zreesc[vNbCuCfLf_,P R O| T ^O _SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:]202/:N53C:C Lnote: _in instantiation of member function 'RunWorkElement, 3, 2>::run' requested hereS TEPS /202s | i z e o f ( T ) )R u{n W o| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k E l| e group(groupm ent, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereo to>() .916r | u n ( w e ) ;p r i| m ^s (groupT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cppi:d6,: 1g:r onote: uin instantiation of member function 'RunWork, 3, 2>::run' requested herep Nthre a6d | sI,M P&Lr_eCcOvL,L _&FsUeNnCd(,A lalrRgesd-u>csee,n dCbOuLfLfN,E Ta_rCgHsA-I>Nr,e cSvIbMuPfLfE,, S| u ^m PostDiv,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :i202n:t533:2 _note: tin instantiation of member function 'RunWorkElement, 3, 2>::run' requested here) | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : note: Rexpanded from macro 'IMPL_COLL_FUNC'u nWorkE l391e | m e nRtu, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t dat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ a1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h2:,514 :f9l:a gwarning: 2variable 'offset' set but not used [-Wunused-but-set-variable]; | ^~~~~ 514 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 153 : 21i:n twarning: unused variable 'flag1' [-Wunused-variable]o ffs e153t | = t iudi;n t 3| 2 ^_ t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :warning: 562initializer order does not match the declaration order [-Wreorder-ctor]: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t esptSeipzSei(znec(cnlcSchlmSehmm.ecmo.mcmo.mbmu.fbfuSfifzSeisz[eNsC[CNLC_CPLR_OPTROO_TSOI_MSPILMEP]L/EN]C/CNLC_CSLT_ESPTSE/PsSi/zseiozfe(oTf)()T ){) {| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| group(group | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h::734:: 7note: :in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | 34 | p r ipmrsi(mtsi(dt,i dn,t hnrtehardesa,d s&,r i&nrgi-n>gp-r>epvr,e v&,r i&nrgi-n>gn-e>xnte,x ta,r gasr-g>ss-e>nsdebnudfbfu,f fa,r gasr-g>sr-e>crvebcuvfbfu,f fa,r gasr-g>sr-e>drOepdAOrpgA,r g0,, 0a,r gasr-g>sc-o>ncnoInnndIenxd,e xa,r gasr-g>sc-o>ncnoInnndIenxd)e;x ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h80::805::5 :note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herenote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | 80 | r urnuRniRnigno(>a(ragrsg)s;) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202202::5353:: note: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested herein instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202202 | | RRuunnWWoorrkkEElleemmeenntt<>(())..rruunn((wwee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp4::61::1 :note: in instantiation of member function 'RunWork, 1, 2>::run' requested herenote: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | 6I | MIPMLP_LC_OCLOLL_LF_UFNUCN(CR(eRdeudcuec,e ,R IRNIGN,G ,S ISMIPMLPEL,E ,S uSmu,m ,i nitn8t_3t2)_ t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h95::391 :note: 95expanded from macro 'IMPL_COLL_FUNC': note: expanded from macro 'IMPL_COLL_FUNC' 391 | R391u | n W oRruknt,y pNeC>C,L _NACLCGLO__A#L#GaOl_g#o, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ dx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclSh), | ^~~~~~~~~~~ mem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht32_t): 562 :| 15^: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _P R| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T O_##p r563o | t o > ( )s.treupnS(i&znec(cnlcSchlmSehmm.ewmo.rcko)m;m .\b u f| f ^S izes[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Tnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'_ SIMPL E562] | / N C C Lt_iSdT(EtPiSd/)s,i znetohfr(eTa)d)s ({n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group) , tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:e34a:d7I:d xnote: .in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex ), grou p34( | g r o u p ) ,p r i| m ^~~~~~~~~~~~~~~~~s (tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s, &r i562n | g - > p rteivd,( t&irdi)n,g -n>tnherxeta,d sa(rngtsh-r>esaednsd)b,u ftfi,d IanrBglso-c>kr(etchvrbeuafdfI,d xa.rxg)s,- >grreoduOpp(Agrrgo,u p0),, a r| g ^~~~~~~~~~~s ->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h; : 562| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 562 | 6 | ItMiPdL(_tCiOdL)L,_ FnUtNhCr(eRaeddsu(cnet,h rReIaNdGs,) ,S ItMiPdLIEn,B lSoucmk,( tihnrte3a2d_Itd)x . x| )^, gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p391(:g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC') , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 391| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) RunW o563r | k < n c csltFeupnSci#z#ef(unnccc,l Sthympeem,. cFoumnmc.#b#udfefvSriezdeosp[P,R ONTCOC_LS_IAMLPGLOE_]#/#NaClCgLo_,S TNECPCSL/_sPiRzOeToOf_(#T#)p)r o{t o >| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) . r| u group(groupn (&ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hm:.34w:o7r:k )note: ;in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here \ | ^ 34 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 :p rnote: ifield 'nthreads' will be initialized after field 'tidInBlock'm s(ti d562, | n t h rteiadd(st,i d&)r,i nngt-h>rperaedvs,( n&trhirnega-d>sn)e,x tt,i daIrngBsl-o>cske(ntdhbruefafd,I daxr.gxs)-,> rgercovubpu(fgfr,o uapr)g,s - >| r ^~~~~~~~~~~~~~~~~e dOp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:r562g:,60 :0 ,note: field 'group' will be initialized after field 'stepSize'a rgs- >562c | o n n I ntdiedx(,t iadr)g,s -n>tchornenaIdnsd(enxt)h;r e a| d ^s ), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:I80n:B5l:o cnote: kin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here( thr e80a | d I d x .rxu)n,R ignrgo(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork8,_ tN)C C L| _^A LGO_##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:,391 :N95C:C Lnote: _expanded from macro 'IMPL_COLL_FUNC'P ROTO_## p391r | o t oR>u(n)W.orrukn<(n&cncclcFluSnhcm#e#mf.uwnocr,k )t;y p\e , | F ^u nc##d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:v562r:e15d:o pnote: , 562N | C C L _ AtLiGdO(_t#i#da)l,g on,t hNrCeCaLd_sP(RnOtThOr_e#a#dpsr)o,t ot>i(d)I.nrBulno(c&kn(ctchlrSehamdeImd.xw.oxr)k,) ;g r\o u p| ( ^g roup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~~~~~~~562 :15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'nthreads' will be initialized after field 'tidInBlock': 60: note: field 'group' will be initialized after field 'stepSize' 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~) , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~562 :60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562S:T15E:P Swarning: /initializer order does not match the declaration order [-Wreorder-ctor]s izeof(T)) 562{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(group( tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ha:d34s:)7,: tnote: iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered InBlock(t h34r | e a d I d x .pxr)i,m sg(rtoiudp,( gnrtohurpe)a,d s ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~& r i| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g ->p r563e | v , & rsitnegp-S>inzeex(tn,c calrSghsm-e>ms.ecnodmbmu.fbfu,f faSrigzse-s>[rNeCcCvLb_uPfRfO,T Oa_rSgIsM-P>LrEe]d/ONpCACrLg_,S T0E,P Sa/rsgisz-e>ocfo(nTn)I)n d{e x ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a r g| s group(group- >connInd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.he:x34):;7 : | note: ^in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :3480 | : 5 : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here prim s80( | t i d , rnutnhRrienagds, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE, S:u562m:,15 :u iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]t 32_t) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nth r391e | a d sR(unntWhorreka , | N ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C C L| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)A LGO_# #563a | l g o , sNtCeCpLS_iPzReO(TnOc_c#l#Sphrmoteom>.(c)o.mrmu.nb(u&fnfcScilzSehsm[eNmC.CwLo_rPkR)O;T O\_ S I| M ^P LE]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562S:T15E:P Snote: /field 'nthreads' will be initialized after field 'tidInBlock's izeof (562T | ) ) { t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t i| d group(group) , nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ht:h34r:e7a:d snote: )in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, tidInB l34o | c k ( t h r epardiImdsx(.txi)d,, gnrtohurpe(agdrso,u p&)r,i n g| - ^~~~~~~~~~~~~~~~~> pre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hv:,562 :&60r:i nnote: gfield 'group' will be initialized after field 'stepSize'- >nex t562, | a r g st-i>ds(etniddb)u,f fn,t harregasd-s>(rnetchvrbeuafdfs,) ,a rtgisd-I>nrBeldoOcpkA(rtgh,r e0a,d Iadrxg.sx-)>,c ognrnoIunpd(egxr,o uapr)g,s - >| c ^~~~~~~~~~~o nnIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cppf:l1a: gIn file included from 2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h;: 10 : | In file included from ^~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h::562153::1521:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]unused variable 'flag1' [-Wunused-variable] 153 | u562i | n t 3 2 _tti dd(attiad1),, fnltahgr1e,a ddsa(tnat2h,r efaldasg)2,; t i| d ^~~~~I nBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ho:c153k:(28t:h rwarning: eunused variable 'data2' [-Wunused-variable]a dId x153. | x ) , gurionutp3(2group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S15h:m ewarning: minitializer order does not match the declaration order [-Wreorder-ctor]. work); \ 562 | | ^ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, note: nfield 'nthreads' will be initialized after field 'tidInBlock't hread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d x .| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , gr o563u | p ( g r osutpe)p,S i z| e ^~~~~~~~~~~~~~~~~( ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562h:m60e:m .note: cfield 'group' will be initialized after field 'stepSize'o mm.buf f562S | i z e s [tNiCdC(Lt_iPdR)O,T On_tShIrMePaLdEs](/nNtChCrLe_aSdTsE)P,S /tsiidzIenoBfl(oT)c)k ({t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d I| d group(groupx .x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hu:p34(:g7r:o unote: pin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , | ^~~~~~~~~~~ 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562&:r15i:n gwarning: -initializer order does not match the declaration order [-Wreorder-ctor]> next, a562r | g s - > steindd(btuifdf),, anrtghsr-e>ardesc(vnbtuhfrfe,a dasr)g,s -t>irdeIdnOBplAorcgk,( t0h,r eaardgIsd-x>.cxo)n,n Ignrdoeuxp,( garrogusp-)>,c o n| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I n d| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x ); | 563 ^ | st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.he:p80S:i5z:e (note: nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herec clSh m80e | m . c o mrmu.nbRuifnfgSS(IaMrPgLsE)];/ N C| C ^L _STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:i202z:e53o:f (note: Tin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here) ) { | 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hn:W34o:r7k:E lnote: ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem enta(d)s.,r u&nr(iwneg)-;> p r| e ^v , &ring-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp>:n12e:x1t:, note: ain instantiation of member function 'RunWork, 1, 2>::run' requested herer gs->s e12n | dIbMuPfLf_,C OaLrLg_sF-U>NrCe(cRvebduufcfe,, aRrIgNsG-,> rSeIdMOPpLAEr,g ,P r0o,d ,a rdgosu-b>lceo)n n I| n^d ex, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g391s:-95>:c onote: nexpanded from macro 'IMPL_COLL_FUNC'n Index); 391 | | ^ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hW:o80r:k5<:n cnote: cin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herel Fu n80c | # # f u nrcu,n Rtiynpge<,T ,F uRnecd#O#pd,e vPrreodtoop><(tarygpse)>;, N| C ^C L_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hG:O202_:#53#:a lnote: gin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereo , N C202C | L _ P R O T O _ #R#upnrWootrok>E(l)e.mreunnt(<&Fnnc,c lTS,h mReemd.Owpo,r kA)l;g o\, P| r ^o to>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562(:w15e:) ;note: field 'nthreads' will be initialized after field 'tidInBlock' | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp562: | 7 : 1 : tnote: iin instantiation of member function 'RunWork, 1, 2>::run' requested hered (ti d7) | ,I MnPtLh_rCeOaLdLs_(FnUtNhCr(eRaeddsu)c,e ,t iRdIINnGB,l oScIkM(PtLhEr,e aPdrIoddx,. xu)i,n tg3r2o_utp)( g r| o^u p),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ^~~~~~~~~~~~~~~~~95 : note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: 391note: | field 'group' will be initialized after field 'stepSize' RunWo r562k | < n c c ltFiudn(ct#i#df)u,n cn,t htryepaed,s (Fnutnhcr#e#addesv)r,e dtoipdc,k (NtChCrLe_aAdLIGdOx_.#x#)a,l ggor,o uNpC(CgLr_oPuRpO)T,O _ #| # ^~~~~~~~~~~p roto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ 11 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cppL:_13C:O1L:L _note: Fin instantiation of member function 'RunWork, 1, 2>::run' requested hereU NC(R e13d | uIcMeP,L _RCIONLGL,_ FSUINMCP(LREe,d uPcreo,d ,R IfNlGo,a tS)I M P| L^E , Prod, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:c391c:l95_:b fnote: lexpanded from macro 'IMPL_COLL_FUNC'o at16) | 391^ | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hW:o391r:k95<:n cnote: cexpanded from macro 'IMPL_COLL_FUNC'l Func## f391u | n c ,R utnyWpoer,k e,, NFCuCnLc_#A#LdGeOv_r#e#daolpgC,L _NPCRCOLT_OA_L#G#Op_r#o#taol>g(o),. rNuCnC(L&_nPcRcOlTSOh_m#e#mp.rwootrok>)(;) .\r u n| ( ^& ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:w562o:r15k:) ;note: field 'nthreads' will be initialized after field 'tidInBlock'\ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t60i:d (note: tfield 'group' will be initialized after field 'stepSize'i d), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~, group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | 562 | R utniWdo(rtkiEdl)e,m enntthk(()t.hrruena(dwIed)x;. x )| , ^ group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp(:g7r:o1u:p )note: ,in instantiation of member function 'RunWork, 1, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 7 | | I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)M PL_COL L563_ | F U N C (sRteedpuSciez,e (RnIcNcGl,S hSmIeMmP.LcEo,m mP.rboudf,f Suiiznets3[2N_CtC)L _ P| R^O TO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:P391L:E95]:/ Nnote: Cexpanded from macro 'IMPL_COLL_FUNC'C L_STEPS /391s | i z eRoufn(WTo)r)k <{n c c| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F u n| c group(group# #func, ty/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hp:e34,: 7F:u nnote: cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #devre d34o | p < t y p e >p,r iNmCsC(Lt_iAdL,G On_t#h#raelagdos,, N&CrCiLn_gP-R>OpTrOe_v#,# p&rroitnog>-(>)n.erxutn,( &anrcgcsl-S>hsmeenmd.bwuofrfk,) ;a r\g s -| > ^r ecvbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:f562,: 15a:r gnote: sfield 'nthreads' will be initialized after field 'tidInBlock'- >redO p562A | r g , 0t,i da(rtgisd-)>,c onntnhIrnedaedxs,( natrhgrse-a>dcso)n,n ItniddeIxn)B;l o c| k ^( threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:I80d:x5.:x )note: ,in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here gro u80p | ( g r o urpu)n,R i n| g ^~~~~~~~~~~~~~~~~< T, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:e562d:O60p:, note: Pfield 'group' will be initialized after field 'stepSize'r oto> (562a | r g s ) ;t i d| ( ^t id), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d snote: (in instantiation of member function 'RunWorkElement, 1, 2>::run' requested heren thre a202d | s ) , t i d I nRBulnoWcokr(ktEhlreemaednItd().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp ^~~~~~~~~~~: 1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)15,: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: 562note: | field 'group' will be initialized after field 'stepSize' ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u p| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cppi:d1(: tIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:)10,: In file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.ht:h167r: e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads), tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~d InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:u562n:R15i:n gwarning: ( atrigds()t;i d )| , ^ nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:(53n:t hnote: rin instantiation of member function 'RunWorkElement, 1, 2>::run' requested heree ads )202, | t i d I n B l oRcukn(WtohrrkeEaldeImdexn.tx<)F,n ,g rTo,u pR(egdrOopu,p )A,l g o| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ P r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t o>().ru n563( | w e ) ; s t| e ^p Size(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cppn:c6c:l1S:h mnote: ein instantiation of member function 'RunWork, 1, 2>::run' requested herem .co m6m | .IbMuPfLf_SCiOzLeLs_[FNUCNCCL(_RPeRdOuTcOe_,S IRMIPNLGE,] /SNICMCPLL_ES,T EMPiSn/,s iizneto3f2(_Tt))) {| ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : group(group391 :95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34 :3917 | : note: Rin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu nWorkyppree>v,, N&CrCiLn_gA-L>GnOe_x#t#,a lagrog,s -N>CsCeLn_dPbRuOfTfO,_ #a#rpgrso-t>or>e(c)v.bruufnf(,& nacrcglsS-h>mreemd.OwpoArrkg),; 0\, a| r ^g s->co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:n562I:n15d:e xnote: ,field 'nthreads' will be initialized after field 'tidInBlock' args- >562c | o n n I ntdiedx()t;i d )| , ^ nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.he:a80d:s5(:n tnote: hin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herer ead s80) | , t i drIunnBRlioncgk<(Tt,h rReeaddOIpd,x .Pxr)o,t og>r(oaurpg(sg)r;o u p| ) ^, | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here: 60: note: 202field 'group' will be initialized after field 'stepSize' | 562 | R u n W otrikdE(lteimde)n,t l(o)c.kr(utnh(rweea)d;I d x| . ^x ), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cppu:p8(:g1r:o unote: pin instantiation of member function 'RunWork, 1, 2>::run' requested here) , | ^~~~~~~~~~~8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s (nthr e563a | d s ) , sttiedpISniBzleo(cnkc(ctlhSrhemaedmI.dcxo.mxm).,b ugfrfoSuipz(egsr[oNuCpC)L,_ P R| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T O _| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I MPLE] /563N | C C L _ SsTtEePpSS/isziez(enocfc(lTS)h)m e{m . c| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m m .| b group(groupu ffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hN:C34C:L7_:P Rnote: Oin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT O_SIMPL E34] | / N C C L _ SpTrEiPmSs/(stiizde,o fn(tTh)r)e a{d s ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~& r i| n group(groupg ->prev,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :&34r:i7n:g -note: >in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren ext, ar g34s | - > s e n d bpurfifm,s (atrigds,- >nrtehcrvebaudfsf,, &arrignsg-->>rperdeOvp,A r&gr,i n0g,- >anregxst-,> caorngnsI-n>dseexn,d baurfgfs,- >acrognsn-I>nrdeecxv)b;u f f| , ^ args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:e80d:O5p:A rnote: gin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here, 0, a80r | g s - > crounnnRIinndge,c oPnrnoItnod>e(xa)r;g s )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here202 :53: 80note: | in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here ru n202R | i n g < T , R eRduOnpW,o rPkrEolteom>e(natr().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:o15f:( Twarning: )initializer order does not match the declaration order [-Wreorder-ctor]) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 562 group(group | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hi:d34):,7 :n tnote: hin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads(nt h34r | e a d s ) , ptriidmIsn(Btliodc,k (ntthhrreeaaddIsd,x .&xr)i,n gg-r>opurpe(vg,r o&urpi)n,g - >| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e x t| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) args- >563s | e n d b usftfe,p Sairzges(-n>crcelcSvhbmuefmf.,c oamrmg.sb-u>frfeSdiOzpeAsr[gN,C C0L,_ PaRrOgTsO-_>ScIoMnPnLIEn]d/eNxC,C La_rSgTsE-P>Sc/osninzIenodfe(xT));) {| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562<:n15c:c lwarning: Finitializer order does not match the declaration order [-Wreorder-ctor]u nc##func, t562y | p e , Ftuindc(#t#idde)v,r endtohprn,t hNrCeCaLd_sA)L,G Ot_i#d#IanlBgloo,c kN(CtChLr_ePaRdOITdOx_.#x#)p,r ogtroo>u(p)(.grruonu(p&)n,c c l| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h m e| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). work) ;563 | \ | ^s tepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:(15n:c cnote: lfield 'nthreads' will be initialized after field 'tidInBlock'S hmem .562c | o m m . btuifdf(Stiizde)s,[ NnCtChLr_ePaRdOsT(On_tShIrMePaLdEs])/, NtCiCdLI_nSBTlEoPcSk/(stihzreeoafd(ITd)x). x{) , | g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group( group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::655562::1160:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herefield 'group' will be initialized after field 'stepSize' 562655 | | t i d ( t i dp)r,i mnst(htrieda-dtsi(dnSttharretaRdesd)u,c et,i dnITnhBrleoacdks(Rtehdruecaed,I dnxu.lxl)p,t rg,r o&udpi(rgercotu-p>), | ^~~~~~~~~~~ out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i15d:( twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d ), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,t iNdC(CtLi_dA)L,G On_t#h#raelagdos,( nNtChCrLe_aPdRsO)T,O _t#i#dpIrnoBtloo>c(k)(.trhurne(a&dnIcdcxl.Sxh)m,e mg.rwoourpk()g;r o\u p )| , ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 562:15: 563note: | field 'nthreads' will be initialized after field 'tidInBlock' st e562p | S i z e (tnicdc(ltSihdm)e,m .nctohmrme.abdusf(fnStihzreesa[dNsC)C,L _tPiRdOITnOB_lSoIcMkP(LtEh]r/eNaCdCILd_xS.TxE)P,S /gsriozuepo(fg(rTo)u)p ){, | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h562: | 655 : 11 : tnote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered (tid) ,655 | n t h r e a d s ( n tphrriemasd(st)i,d -ttiiddISntBalrotcRke(dtuhcree,a dnITdhxr.exa)d,s Rgerdouucpe(,g rnouulpl)p,t r ,| ^~~~~~~~~~~& direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 4 | ItMiPdL(_tCiOdL)L,_ FnUtNhCr(eAaldlsR(endtuhcree,a dCsO)L,L NtEiTd_IDnIBRlEoCcTk,( tShIrMePaLdEI,d xS.uxm),, ignrto8u_pt()g r o| u^p ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~391 : 95| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: expanded from macro 'IMPL_COLL_FUNC' 563 | 391s | t e pRSuinzWeo(rnkcN,C CNLC_CSLT_EAPLSG/Os_i#z#eaolfg(oT,) )N C{C L _| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R O T| O group(group_ ##proto>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:(666&:n9c:c lnote: Sin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh mem.wo r666k | ) ; \ | ^ prims(tid, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:T562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock'G ather ,562 | d i r e ctti-d>(utpi,d )N,U LnLt,h raeragdss-(>nstehnrdebaudfsf),, atrigdsI-n>Brleoccvkb(utfhfr,e a d| I ^d x.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r202o:u53p:( gnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo up), 202 | | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u60n:W onote: rfield 'group' will be initialized after field 'stepSize'k Eleme n562t | < F n , tTi,d (RteiddO)p,, nAtlhgroe,a dPsr(onttoh>r(e)a.drsu)n,( wtei)d;I n B| l ^o ck(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr:e5a:d1I:d xnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herex ), g r5o | uIpM(PgLr_oCuOpL)L,_ F U| N ^~~~~~~~~~~C (AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:E562]:/15N:C Cwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]_ STEPS/sizeof (562T | ) ) { t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t i| d group(group) , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:I dwarning: xinitializer order does not match the declaration order [-Wreorder-ctor]. x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~) , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnthre:a562d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Block( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), tid I563n | B l o c ks(ttehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~s [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562: | 562 : 15 : twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d (tid), nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~. x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:E15l:e mwarning: einitializer order does not match the declaration order [-Wreorder-ctor]n tr(e)a.drsu(nn(twher)e;a d s| ) ^, tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppk:(6t:h1r:e anote: din instantiation of member function 'RunWork, 2, 2>::run' requested hereI dx.x) ,6 | gIrMoPuLp_(CgOrLoLu_pF)U,N C (| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l l R| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d uce, 563C | O L L N EsTt_eDpISRiEzCeT(,n cScIlMSPhLmEe,m .Scuomm,m .ibnutf3f2S_itz)e s [| N^C CL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T391O:_95S:I Mnote: Pexpanded from macro 'IMPL_COLL_FUNC'L E]/NCCL _391S | T E PRSu/nsWiozreko, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer edop< t655y | p e > , N C C L _ ApLrGiOm_s#(#taildg-ot,i dNSCtCaLr_tPRReOdTuOc_e#,# pnrTohtroe>a(d)s.Rreudnu(c&en,c cnluSlhlmpetmr.,w o&rdki)r;e c\t - >| o ^u t, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:s15e:n dnote: bfield 'nthreads' will be initialized after field 'tidInBlock'u ff, a r562g | s - > r etcivdb(utfifd,) , | n ^t hreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres ), t202i | d I n B l o c k (RtuhnrWeoardkIEdlxe.mxe)n,t :( )note: .field 'group' will be initialized after field 'stepSize'r un(we) ;562 | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppi:d5):,1 :n tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer eads (5n | tIhMrPeLa_dCsO)L,L _tFiUdNICn(BAllolcRke(dtuhcree,a dCIOdLxL.NxE)T,_ DgIrRoEuCpT(,g rSoIuMpP)L,E , | S ^~~~~~~~~~~u m, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]n Block(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Block (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidIn B563l | o c k ( tshtreepaSdiIzdex(.nxc)c,l Sghrmoeump.(cgormomu.pb)u,f f S| i ^~~~~~~~~~~z es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562391::1595:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]expanded from macro 'IMPL_COLL_FUNC' 391 | 562R | u n W o rtkie,a dNICdCxL._xA)L,G Og_r#o#uapl(ggor,o uNpC)C,L _ P| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O T O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #prot o563> | ( ) . r usnt(e&pnSciczleS(hnmcecml.Swhomrekm).;c o\m m .| b ^u ffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:[562N:C15C:L _note: Pfield 'nthreads' will be initialized after field 'tidInBlock'R OTO_SI M562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP | :L 562E :] 15/ :Nt Ciwarning: Cdinitializer order does not match the declaration order [-Wreorder-ctor]L( _tSiTdE) P,562S | /n st ih zr eetoaifdd(s(T(t)ni)td h){r, e an| dt ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~sh )r ,e| a group(grouptd isd(InntBhlroecakd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(s:t)641h,:r 11et:ai ddnote: IIin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heredn xB.lxo)c, k 641(g | tr ho ru ep a( dg Ir do xu .p x)p),r, i mg| sr ^~~~~~~~~~~~~~~~~(o tui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hpd:(-562gt:ri60od:uS ptnote: )afield 'group' will be initialized after field 'stepSize',r t R| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562d | u c| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) , tniT dh563(r | te ia dd )s ,Rs etndetuphcSreie,za edd(sin(rcnectclhtSr-he>madedomsw.)nc,,o mt&mid.dibIrunefBcfltSo-ic>zke(st[hNrCeCaLd_IPdRxO.TxO)_,S IgMrPoLuEp](/gNrCoCuLp_)S,T E P| S ^~~~~~~~~~~/ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h), gr:o562u:p15(:g rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]u p), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:S15t:a rwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]B cast, nThr e562a | d s B c atsitd,( t&iddi)r,e cntt-h>roeuatd,s (dnitrherceta-d>sd)o,w nt,i daIrngBsl-o>cske(ntdhbruefafd,I daxr.gxs)-,> rgercovubpu(fgfr,o u p| ) ^, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53 :563 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here st e202p | S i z e ( n c c lRSuhnmWeomr.kcEolmemm.ebnutfN(C)C.Lr_uSnT(EwPeS)/;s i z| e ^o f(T)) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp : 5| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~1 : | note: group(groupin instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:L687_:C11O:L note: Lin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ FUNC(A l687l | R e d u c e , C O LpLrNiEmTs_(DtIiRdE-CtTi,d SStIaMrPtLBEc,a sStu,m ,n Tuhirneta8d_stB)c a s| t^, &di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391c:t95-:> onote: uexpanded from macro 'IMPL_COLL_FUNC't , null p391t | r , RaurngWso-r>ks,r etcyvpbeu,f fF,u n c| # ^# devredo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:<202t:y53p:e >note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here NCC L202_ | A L G O _ # # a lRguon,W oNrCkCELl_ePmReOnTtO<_F#n#,p rTo,t oR>e(d)O.pr,u nA(l&gnoc,c lPSrhomteom>.(w)o.rrku)n;( w\e ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp::154:: 1note: :field 'nthreads' will be initialized after field 'tidInBlock' note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 4 | I M PtLi_dC(OtLiLd_)F,U NnCt(hArlelaRdesd(uncteh,r eCaOdLsL)N,E Tt_iDdIIRnEBClTo,c kS(ItMhPrLeEa,d ISduxm.,x )i,n tg8r_otu)p ( g| r^o up),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ^~~~~~~~~~~~~~~~~95 : note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hexpanded from macro 'IMPL_COLL_FUNC': 562:60: note: field 'group' will be initialized after field 'stepSize'391 | R u562n | W o r k h,r eNaCdCILd_xA.LxG)O,_ #g#raolugpo(,g rNoCuCpL)_,P R O| T ^~~~~~~~~~~O _##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)15,: twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d InBlock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~a ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s (nthread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~k (th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60I:d xnote: .field 'group' will be initialized after field 'stepSize'x ), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads(nt h563r | e a d s )s,t etpiSdiIzneB(lnoccckl(Sthhmreema.dcIodmxm..xb)u,f fgSriozueps([gNrCoCuLp_)P,R O T| O ^~~~~~~~~~~_ SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:#562#:p15r:o twarning: oinitializer order does not match the declaration order [-Wreorder-ctor]> ().run( &562n | c c l S htmiedm(.twiodr)k,) ;n t\h r e| a ^d s(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:) ,note: field 'nthreads' will be initialized after field 'tidInBlock't idInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I nBloc k563( | t h r e asdtIedpxS.ixz)e,( ngcrcoluSph(mgermo.ucpo)m,m . b| u ^~~~~~~~~~~~~~~~~f fSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:[562N:C60C:L _note: Pfield 'group' will be initialized after field 'stepSize'R OTO_SI M562P | L E ] / NtCiCdL(_tSiTd)E,P Sn/tshirzeeaodfs((Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~), ti d I| n group(groupB lock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I687d:x11.:x )note: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here group (687g | r o u p ) , | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I dx.x) ,563 | g r o u ps(tgerpoSuipz)e,( n c| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l S h| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e m.com m563. | b u f f SsitzeepsS[iNzCeC(Ln_cPcRlOSThOm_eSmI.McPoLmEm]./bNuCfCfLS_iSzTeEsP[SN/CsCiLz_ePoRfO(TTO)_)S I{M P L| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~] / N| C group(groupC L_STEPS/siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:o641f:(11T:) )note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~641 | | group(group pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:m641s:(11t:i dnote: -in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret idStart R641e | d u c e , n T h r epardismRse(dtuicde-,t iddiSrteacrtt-R>eddouwcne,, &ndTihrreecatd-s>Roeudtu,c ea,r gdsi-r>escetn-d>bduofwfn,, a&rdgisr-e>crte-c>vobuutf,f ,a r g| s ^- >send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hb:u202f:f53,: anote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg s-> r202e | c v b u f f , R| u ^n WorkEleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t202<:F53n:, note: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, RedO p202, | A l g o , P rRoutnoW>o(r)k.Erluenm(ewnet)<;F n ,| ^T , Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppO:p6,: 1A:l gnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested here, Pro t6o | >I(M)P.Lr_uCnO(LwLe_)F;U N C| ( ^A llReduce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp,: 6C:O1L:L Nnote: Ein instantiation of member function 'RunWork, 2, 2>::run' requested hereT _DIRE C6T | ,I MSPILM_PCLOEL,L _SFuUmN,C (iAnltl3R2e_dtu)c e ,| ^C OLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:T391_:D95I:R Enote: Cexpanded from macro 'IMPL_COLL_FUNC'T , SIM P391L | E , RSuunmW,o rikn, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h; \ | : ^562 :15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hinitializer order does not match the declaration order [-Wreorder-ctor]: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 :60: note: field 'group' will be initialized after field 'stepSize'563 | 562s | t e p S itzied((ntcicdl)S,h mnetmh.rceoamdms.(bnutfhfrSeiazdess)[,N CtCiLd_IPnRBOlToOc_kS(ItMhPrLeEa]d/INdCxC.Lx_)S,T EgPrSo/uspi(zgeroofu(pT)),) {| ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :N562C:C15L:_ Awarning: Linitializer order does not match the declaration order [-Wreorder-ctor]G O_##algo, 562N | C C L _ PtRiOdT(Ot_i#d#)p,r onttoh>r(e)a.drsu(nn(t&hnrcecaldSsh)m,e mt.iwdoIrnkB)l;o c\k ( t| h ^r eadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:)562,: 15g:r onote: ufield 'nthreads' will be initialized after field 'tidInBlock'p (group )562, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( tid) ,563 | n t h r esatdesp(Snitzher(enacdcsl)S,h mteimd.IcnoBmlmo.cbku(ftfhSriezaedsI[dNxC.CxL)_,P RgOrToOu_pS(IgMrPoLuEp])/,N C C| L ^~~~~~~~~~~~~~~~~_ STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:/562s:i60z:e onote: ffield 'group' will be initialized after field 'stepSize'( T)) { 562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | t group(groupi d(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h626r:e9a:d snote: (in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren threa d626s | ) , t i d I n Bplroicmks((tthirde-atdiIddSxt.axr)t,S cgartotuepr(,g rnoTuhpr)e,a d s| S ^~~~~~~~~~~c atter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i15d:( twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d ), nthreads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I d x| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x ), gr o563u | p ( g r osutpe)p,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n c| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l Shmem .563c | o m m . bsutfefpSSiizzees([nNcCcClLS_hPmReOmT.Oc_oSmImM.PbLuEf]f/SNiCzCeLs_[SNTCECPLS_/PsRiOzTeOo_fS(ITM)P)L E{] / N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| S group(groupT EPS/sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:)687): 11{: note: | in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :p666r:i9m:s (note: tin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei d-ti d666S | t a r t B c a s tp,r inmTsh(rteiadd,s BncTahsrte,a d&sdGiartehcetr-,> oduitr,e cntu-l>luppt,r ,N UaLrLg,s -a>rsgesn-d>bsuefnfd,b uafrfg,s -a>rrgesc-v>bruefcfv,b u f| f ^, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h53::202 :note: 53in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R u n WRournkWEolrekmEelnetmo(t)o.>r(u)n.(rwuen)(;w e )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp7::81:1:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 78 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, CCOOLLLLNNEETT__DDIIRREECCTT,, SSIIMMPPLLEE,, SSuumm,, uiinntt6342__tt)) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391391::9595:: note: note: expanded from macro 'IMPL_COLL_FUNC'expanded from macro 'IMPL_COLL_FUNC' 391391 | | RRuunnWWoorrkk<>,, NNCCCCLL__AALLGGOO__####aallggoo,, NNCCCCLL__PPRROOTTOO__####pprroottoo>>(())..rruunn((&&nnccccllSShhmmeemm..wwoorrkk));; \\ | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6060:: note: note: field 'group' will be initialized after field 'stepSize'field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkup, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~ c##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : warning: pinitializer order does not match the declaration order [-Wreorder-ctor]r ims(tid- t562i | d S t a rttiBdc(atsitd,) ,n TnhtrheraedasdBsc(anstth,r e&addisr)e,c tt-i>doIuntB,l oncukl(ltphtrre,a daIrdgxs.-x>)s,e ngdrbouufpf(,g raorugps)-,> r e| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~v b u| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f , | ^ 563 | st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:p202S:i53z:e (note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec clSh m202e | m . c o m m . b uRfufnSWiozreksE[lNeCmCeLn_tPz(e)o.fr(uTn)()w e{) ; | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h1::687 :note: 11in instantiation of member function 'RunWork, 2, 2>::run' requested here: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 7 | IMP L687_ | C O L L _ F U N C ( AplrliRmesd(utcied,- tCiOdLSLtNaErTt_BDcIaRsEtC,T ,n TShIrMePaLdEs,B cSausmt,, u&idnitr3e2c_tt-)> o u| t^, null/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:t391r:,95 :a rnote: gexpanded from macro 'IMPL_COLL_FUNC's ->send b391u | f f ,R uanrWgosr-k>, 2, 2>::run' requested herev redo p202< | t y p e > , N CRCuLn_WAoLrGkOE_l#e#maelngto<,F nN,C CTL,_ PRReOdTOOp_,# #Aplrgoot,o >P(r)o.trou>n(()&.nrcucnl(Swhem)e;m . w| o ^r k); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp| : ^8 :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15 :8 | note: Ifield 'nthreads' will be initialized after field 'tidInBlock'M PL_CO L562L | _ F U N Ct(iAdl(ltRiedd)u,c en,t hCrOeLaLdNsE(Tn_tDhIrReEaCdTs,) ,S ItMiPdLIEn,B lSoucmk,( tihnrte6a4d_Itd)x . x| )^, grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(391g:r95o:u pnote: )expanded from macro 'IMPL_COLL_FUNC', | ^~~~~~~~~~~~~~~~~ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u60n:W onote: rfield 'group' will be initialized after field 'stepSize'k i,d INnCBClLo_cAkL(GtOh_r#e#aadlIgdox,. xN)C,C Lg_rPoRuOpT(Og_r#o#uppr)o,t o >| ( ^~~~~~~~~~~) .run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxz.ex()n,c cglrSohumpe(mg.rcooumpm).,b u f| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S i z| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s [NCCL_PROTO_ S563I | M P L E ]s/tNeCpCSLi_zSeT(EnPcSc/lsSihzmeeomf.(cTo)m)m .{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S i z| e group(groups [NCCL_PROTO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h]:/655N:C11C:L _note: Sin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT EPS/siz e655o | f ( T ) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p r i| m group(groups (tid-tidStartReduce, nT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r655e:a11d:s Rnote: ein instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered uce, nu l655l | p t r , & d i r e cptr-i>mosu(tt,i da-rtgisd-S>tsaerntdRbeudfufc,e ,a rngTsh-r>eraedcsvRbeudfufc,e , | n ^u llptr, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:i202r:e53c:t -note: >in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo ut, a202r | g s - > s e n d bRuufnfW,o rakrEglse-m>ernetcnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) .run (202w | e ) ; | ^ RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr:k7E:l1e:m enote: nin instantiation of member function 'RunWork, 2, 2>::run' requested heret u(c)e.,r uCnO(LwLeN)E;T _ D| I ^R ECT, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppM:P6:L1E:, note: Sin instantiation of member function 'RunWork, 2, 2>::run' requested hereu m, ui n6t | 3I2M_PtL)_ C O| L^L _F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hU:N391C:(95A:l lnote: Rexpanded from macro 'IMPL_COLL_FUNC'e duce, 391C | O L LRNuEnTW_oDrIkRnote: ,expanded from macro 'IMPL_COLL_FUNC' NCCL_A L391G | O _ #R#uanlWgoor,k p(e),. rFuunn(c&#n#cdcelvSrhemdeomp.;, \N C C| L ^_ ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:l562g:o15,: Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_PROTO _562# | # p r o ttoi>d(()t.irdu)n,( &nntchcrleSahdmse(mn.twhorreka)d;s )\, t| i ^d InBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Inote: dfield 'nthreads' will be initialized after field 'tidInBlock'x .x), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~, nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s (note: nfield 'group' will be initialized after field 'stepSize't hread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~k (th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60I:d xnote: .field 'group' will be initialized after field 'stepSize'x ), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hCL_PR:O562T:O15_:# #warning: pinitializer order does not match the declaration order [-Wreorder-ctor]r oto>().run( &562n | c c l S htmiedm(.twiodr)k,) ;n t\h r e| a ^d s(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:) ,note: field 'nthreads' will be initialized after field 'tidInBlock't idInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgn:rt562oh:ur15pe:)a ,dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor] ) | , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| 562d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | I n B l o563tc | ik d( (t th irsdet)ae,dp ISnditxzh.erx(e)na,cd csgl(rSonhutmphe(rmge.racodouspm))m,,. bt uif| d ^~~~~~~~~~~~~~~~~fIS niBzl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.heo:sc562[k:N(60Ct:Ch Lr_note: ePfield 'group' will be initialized after field 'stepSize'aRd OITdOx_. SxI562)M | ,P L gE r] o/tuNipCd(C(gLtr_ioSduT)pE,)P ,Sn /t sh| ir ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ze ea od| fs tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)(( Tn)t) h 563r{ | e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ) s, t | e group(grouptp iSdiIzneB(lnoccckl(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hSt:hh655mr:ee11ma:.d cInote: odin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heremx m..xb)u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562S:I15M:P Lwarning: Einitializer order does not match the declaration order [-Wreorder-ctor]] /NCCL_STE P562S | / s i z etoifd((Tt)i)d ){, n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:B655l:o11c:k (note: tin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh readI d655x | . x ) , g r o u p (pgrriomusp()t,i d -| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d S| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a rtRed u563c | e , n TshtreepaSdiszRee(dnucccel,S hnmuelml.pctorm,m .&bduifrfeScitz-e>so[uNtC,C La_rPgRsO-T>Os_eSnIdMbPuLfEf],/ NaCrCgLs_-S>TrEePcSv/bsuifzfe,o f (| T ^) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : group(group53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :202677 | : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Run W677o | r k E l e m e n t < Fpnr,i mTs,( tRiedd-Otpi,d SAtlagrot,B cParsott,o >n(T)h.rreuand(swBec)a;s t ,| ^& direct/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp-:>7o:u1t:, note: din instantiation of member function 'RunWork, 2, 2>::run' requested herei rect -7> | dIoMwPnL,_ CaOrLgLs_-F>UsNeCn(dAblulfRfe,d uacreg,s -C>OrLeLcNvEbTu_fDfI,R E C| T ^, SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:E202,: 53S:u mnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here uin t2023 | 2 _ t ) | ^ RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k391E:l95e:m enote: nexpanded from macro 'IMPL_COLL_FUNC't #(#)f.urnucn,( wtey)p;e , | F ^u nc##de/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppv:r8e:d1o:p , 2, 2>::run' requested herey pe>, 8N | CICMLP_LA_LCGOOL_L#_#FaUlNgCo(,A lNlCRCeLd_uPcReO,T OC_O#L#LpNrEoTt_oD>I(R)E.CrTu,n (S&InMcPcLlES,h mSeumm.,w oirnkt)6;4 _\t ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562391::1595:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | RtuindW(otrikd<)n,c cnltFhurneca#d#sf(unntch,r etaydpse),, FtuindcI#n#Bdleovcrke(dtohprx,. xN)C,C Lg_rAoLuGpO(_g#r#oaulpg)o,, N| C ^~~~~~~~~~~~~~~~~C L_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_60#:# pnote: rfield 'group' will be initialized after field 'stepSize'o to>( )562. | r u n ( &tnicdc(ltSihdm)e,m .nwtohrrke)a;d s\( n t| h ^r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~a ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] nthreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), g r563o | u p ( g rsotuepp)S,i z e| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n c c| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S hme m563. | c o m m .sbtuefpfSSiizzee(sn[cNcClCSLh_mPeRmO.TcOo_mSmI.MbPuLfEf]S/iNzCeCsL[_NSCTCELP_SP/RsOiTzOe_oSfI(MTP)L)E ]{/ N C| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L _ S| T group(groupE PS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~641 : 11| : group(group note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h641: | 655 : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here p r655i | m s ( t i d - t i d SptrairmtsR(etdiudc-et,i dnSTtharretaRdesdRuecdeu,c en,T hdrieraedcstR-e>dduocwen,, n&udlilrpetcrt,- >&oduitr,e catr-g>so-u>ts,e nadrbgusf-f>,s eanrdgbsu-f>fr,e cavrbgusf-f>,r e c| v ^b uff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h53::202 :note: 53in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 202 | : 562 : 15 : warning: Rinitializer order does not match the declaration order [-Wreorder-ctor]u nWorkElement< F562n | , T , tRiedd(Otpi,d )A,l gnot,h rPeraodtso(>n(t)h.rreuand(sw)e,) ;t i d| I ^n Block(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr:e8a:d1I:d xnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herex ), g r8o | uIpM(PgLr_oCuOpL)L,_ F U| N ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C ( A| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l Redu c563e | , C O LsLtNeEpTS_iDzIeR(EnCcTc,l SShImMePmL.Ec,o mSmu.mb,u fifnSti6z4e_st[)N C C| L^_ PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:S391I:M95P:L Enote: ]expanded from macro 'IMPL_COLL_FUNC'/ NCCL_S T391E | P S /RsuinzWeoorfk(, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo p , N C C L _ ApLrGiOm_s#(#taildg,o ,n TNhCrCeLa_dPsRGOaTtOh_e#r#,p rdoitroe>c(t)-.>ruupn,( &NnUcLcLl,S hamregms.-w>osrekn)d;b u\f f ,| ^a rgs->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b15u:f fnote: ,field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:(53t:i dnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, nth r202e | a d s ( n t h r eRaudnsW)o,r ktEildeImneBnltou(p)).,r u n| ( ^~~~~~~~~~~~~~~~~w e); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7: 1562: | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here tid (7t | iIdM)P,L _nCtOhLrLe_aFdUsN(Cn(tAhlrleRaeddsu)c,e ,t iCdOILnLBNlEoTc_kD(ItRhErCeTa,d ISdIxM.PxL)E,, gSruomu,p (ugirnotu3p2)_,t ) | ^~~~~~~~~~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkT,, NRCeCdLO_pA,L GAOl_g#o#,a lPgroo,t oN>C(C)L._rPuRnO(TwOe_)#;# p r| o ^t o>().run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppc:l7S:h1m:e mnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herew ork) ;7 | \I M P| L ^_ COLL_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562(:A15l:l Rnote: efield 'nthreads' will be initialized after field 'tidInBlock'd uce, C O562L | L N E T _tDiIdR(EtCiTd,) ,S InMtPhLrEe,a dSsu(mn,t hurienatd3s2)_,t )t i d| I^n Block/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t391h:r95e:a dnote: Iexpanded from macro 'IMPL_COLL_FUNC'd x.x), g391r | o u pR(ugnrWoourpk)<,n c c| l ^~~~~~~~~~~~~~~~~F unc#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:f562u:n60c:, note: tfield 'group' will be initialized after field 'stepSize'y pe, F u562n | c # # d etvirde(dtoipd<)t,y pnet>h,r eNaCdCsL(_nAtLhGrOe_a#d#sa)l,g ot,i dNICnCBLl_oPcRkO(TtOh_r#e#apdrIodtxo.>x()),. rgurno(u&pn(cgcrloSuhpm)e,m . w| o ^~~~~~~~~~~r k); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 563 | step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((Tg)r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| group(group | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 687 : 11s:t enote: pin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereS ize(n c687c | l S h m e m . c o m mp.rbiumfsf(Stiizde-st[iNdCSCtLa_rPtRBOcTaOs_tS,I MnPTLhEr]e/aNdCsCBLc_aSsTtE,P S&/dsiirzeecotf-(>To)u)t ,{ n u| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l p t| r group(group, args->sen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:b687u:f11f:, note: ain instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer gs->r e687c | v b u f f , | ^ prims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t202i:d53-:t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS tar t202B | c a s t , n T hRruenaWdosrBkcEalsetm,e n&td oRuetd,O pn,u lAllpgtor,, Parrogtso->>(s)e.nrdubnu(fwfe,) ;a r g| s ^- >recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppb:u7f:f1,: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ^ 7 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:C202O:L53L:_ Fnote: Uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereN C(All R202e | d u c e , C O LRLuNnEWTo_rDkIERlEeCmTe,n tS()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:u391n:(95w:e )note: ;expanded from macro 'IMPL_COLL_FUNC' | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppR:u7n:W1o:r knote: , 2, 2>::run' requested heren cclF u7n | cI#M#PfLu_nCcO,L Lt_yFpUeN,C (FAulnlcR#e#dduecver,e dCoOpLD,I RNECCCTL,_ ASLIGMOP_L#E#,a lSguom,, NuCiCnLt_3P2R_OtT)O _ #| #^p roto>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):.391r:u95n:( ¬e: nexpanded from macro 'IMPL_COLL_FUNC'c clShmem. w391o | r k )R;u n\W o r| k ^< nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hF:u562n:c15#:# fnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n c, t y562p | e , F utnicd#(#tdiedv)r,e dnotph(,n tNhCrCeLa_dAsL)G,O _t#i#daIlngBol,o cNkC(CtLh_rPeRaOdTIOd_x#.#xp)r,o tgor>o(u)p.(rgurno(u&pn)c,c l S| h ^~~~~~~~~~~~~~~~~m em/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:w562o:r60k:) ;note: field 'group' will be initialized after field 'stepSize'\ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dnote: )field 'nthreads' will be initialized after field 'tidInBlock', nthrea d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~x .x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P15L:E ,warning: initializer order does not match the declaration order [-Wreorder-ctor]S um, uin t5623 | 2 _ t ) t i| d^( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC's (nthr e391a | d s )R,u ntWiodrIkn, N C563C | L _ A L GsOt_e#p#Sailzgeo(,n cNcClCSLh_mPeRmO.TcOo_m#m#.pbruoftfoS>i(z)e.sr[uNnC(C&Ln_cPcRlOSThOm_eSmI.MwPoLrEk])/;N C\C L _| S ^T EPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562o:f15(:T )note: )field 'nthreads' will be initialized after field 'tidInBlock' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562 | | group(group tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,626 :n9t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds(nt h626r | e a d s ) , t ipdrIinmBsl(otcikd(-tthirdeSatdaIrdtxS.cxa)t,t egrr,o unpT(hgrreoaudps)S,c a t| t ^~~~~~~~~~~~~~~~~e r, NU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562,: 60d:i rnote: efield 'group' will be initialized after field 'stepSize'c t->up ,562 | a r g s -t>isde(ntdibdu)f,f ,n tahrrgesa-d>sr(enctvhbruefafd,s ) ,| ^t idInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:c202k:(53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea dIdx .202x | ) , g r o u p (RgurnoWuopr)k,E l e| m ^~~~~~~~~~~e nt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666: 9562: | note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tid( t666i | d ) , n t h r epardism(sn(tthirde,a dnsT)h,r etaiddsIGnaBtlhoecrk,( tdhirreeacdtI-d>xu.px,) ,N UgLrLo,u pa(rggrso-u>ps)e,n d b| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f f ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a rgs->r e563c | v b u f fs,t e p| S ^i ze(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:S202h:m53e:m .note: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo mm.b u202f | f S i z e s [ N CRCuLn_WPoRrOkTEOl_eSmIeMnPtL ( )| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r u n| ( group(groupw e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp11::7 :note: 1in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6417 | | I M P L _ C O L L _pFrUiNmCs((AtlildR-etdiudcSet,a rCtORLeLdNuEcTe_,D InRTEhCrTe,a dSsIRMePdLuEc,e ,S udmi,r eucitn-t>3d2o_wtn), &| d^i rect-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:o391u:t95,: anote: rexpanded from macro 'IMPL_COLL_FUNC'g s->se n391d | b u fRfu,n Waorrgks<-n>crcelcFvubnucf#f#,f u n| c ^, type,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :F202u:n53c:# #note: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree vred o202p | < t y p e > , NRCuCnLW_oArLkGEOl_e#m#eanltgr(o)t.or>u(n)(.&rnucnc(lwSeh)m;e m .| w ^o rk); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp\: 8 :| 1 ^: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :8562 | :I15M:P Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'C OLL_FU N562C | ( A l l Rteiddu(ctei,d )C,O LnLtNhET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| ^562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 8 | tIiMdP(Lt_iCdO)L,L _nFtUhNrCe(aAdlsl(Rnetdhurceea,d sC)O,L LtNiEdTI_nDBIlRoEcCkT(,t hSrIeMaPdLIEd,x .Sxu)m,, girnotu6p4(_gtr)o u p| )^, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 95: note: expanded from macro 'IMPL_COLL_FUNC' 563 | s391t | e p RSuinzWeo(rnkcN,C CNLC_CSLT_EAPLSG/Os_i#z#eaolfg(oT,) )N C{C L _| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R O T| O group(group_ ##proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:)677.:r11u:n (note: &in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren cclSh m677e | m . w o r k ) ; \ p r| i ^m s(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:-562t:i15d:S tnote: afield 'nthreads' will be initialized after field 'tidInBlock'r tBcas t562, | n T h rteiadd(stBicda)s,t ,n t&hdrieraedcst(-n>tohurte,a ddsi)r,e ctti->down, args->sendbuffd,I naBrlgosc-k>(rtehcrvebaudfIfd,x . x| ) ^, group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r202o:u53p:) ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 : note: field 'group' will be initialized after field 'stepSize' RunWo r562k | E l e m etnitd<(Ftni,d )T,, nRtehdrOepa,d sA(lngtoh,r ePardost)o,> (tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ).run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h^ :562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::15391:: 95warning: : initializer order does not match the declaration order [-Wreorder-ctor]note: expanded from macro 'IMPL_COLL_FUNC' 391 | 562 | R u n W otrikd<(ntcicdl)F,u nnct#h#rfeuandcs,( nttyhpree,a dFsu)n,c #t#iddeIvnrBeldoocpk<(ttyhpree>a,d INdCxC.Lx_)A,L GgOr_o#u#pa(lggroo,u pN)C,C L _| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R O T| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ ##pr o563t | o > ( ) .srtuenp(S&inzcec(lnSchcmleSmh.mweomr.kc)o;m m\. b u| f ^f Sizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h[:N562C:C15L:_ Pnote: Rfield 'nthreads' will be initialized after field 'tidInBlock'O TO_S I562M | P L E ] /tNiCdC(Lt_iSdT)E,P Sn/tshirzeeaodfs((Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | t group(groupi dInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d666I:d9x:. xnote: )in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, gro u666p | ( g r o u p ) , p r| i ^~~~~~~~~~~~~~~~~m s(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:,562 :n60T:h rnote: efield 'group' will be initialized after field 'stepSize'a dsGat h562e | r , d itriedc(tt-i>du)p,, nNtUhLrLe,a dasr(gnst-h>rseeanddsb)u,f ft,i daIrngBsl-o>crke(ctvhbruefafd,I d x| . ^x ), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(202g:r53o:u pnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ A562L | G O _ # #taildg(ot,i dN)C,C Ln_tPhRrOeTaOd_s#(#nptrhorteoa>d(s)).,r utni(d&InncBclloSchkm(etmh.rweoardkI)d;x .\x ) ,| ^g roup(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15):, note: field 'nthreads' will be initialized after field 'tidInBlock'| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ti d563( | t i d ) ,s tnetphSriezaed(sn(cnctlhSrhemaedms.)c,o mtmi.dbIunfBflSoiczke(st[hNrCeCaLd_IPdRxO.TxO)_,S IgMrPoLuEp](/gNrCoCuLp_)S,T E P| S ^~~~~~~~~~~~~~~~~/ sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:(562T:)60): {note: field 'group' will be initialized after field 'stepSize' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i666d:)9,: nnote: tin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh read s666( | n t h r e a d s )p,r itmisd(ItniBdl,o cnkT(htrheraedasdGIadtxh.exr),, dgirroeucpt(-g>ruopu,p )N,U L L| , ^~~~~~~~~~~ args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, NCCL:_562A:L15G:O _warning: #initializer order does not match the declaration order [-Wreorder-ctor]# algo, NCCL_PR O562T | O _ # # ptriodt(ot>i(d)).,r unnt(h&rnecacdlsS(hnmtehmr.ewaodrsk)),; t\i d I| n ^B lock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'd x.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a ds(nt h563r | e a d s )s,t etpiSdiIzneB(lnoccckl(Sthhmreema.dcIodmxm..xb)u,f fgSriozueps([gNrCoCuLp_)P,R O T| O ^~~~~~~~~~~~~~~~~_ SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E60]:/ Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_ST E562P | S / s i zteiodf((tTi)d)) ,{ n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups (nthreads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:n677B:l11o:c knote: (in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreadI d677x | . x ) , g r o u p (pgrriomusp()t,i d -| t ^~~~~~~~~~~i dStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorki,d )N,C CnLt_hArLeGaOd_s#(#natlhgroe,a dNsC)C,L _tPiRdOITnOB_l#o#cpkr(otthor>e(a)d.Irduxn.(x&)n,c cglrSohumpe(mg.rwoourpk)),; \| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: 563note: | field 'nthreads' will be initialized after field 'tidInBlock' st e562p | S i z e (tnicdc(ltSihdm)e,m .nctohmrme.abdusf(fnStihzreesa[dNsC)C,L _tPiRdOITnOB_lSoIcMkP(LtEh]r/eNaCdCILd_xS.TxE)P,S /gsriozuepo(fg(rTo)u)p ){, | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ | group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677t:i11d:( tnote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered ), nthr e677a | d s ( n t h r e a d sp)r,i mtsi(dtIindB-ltoicdkS(ttahrrteBacdaIsdtx,. xn)T,h rgeraoduspB(cgarsotu,p )&,d i r| e ^~~~~~~~~~~c t->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pe, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ educe, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:p562r:o15t:o >warning: (initializer order does not match the declaration order [-Wreorder-ctor]) .run(&ncc l562S | h m e m .twiodr(kt)i;d )\, n| t ^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:S562/:s15i:z ewarning: oinitializer order does not match the declaration order [-Wreorder-ctor]f (T)) { 562| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt id(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r655e:a11d:s (note: nin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreads )655, | t i d I n B l o c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562: | 562 : 15 : twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d (tid), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up(gr o563u | p ) , s| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e p S| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)z e(ncc l563S | h m e m .sctoempmS.ibzuef(fnSciczleSsh[mNeCmC.Lc_oPmRmO.TbOu_fSfISMiPzLeEs][/NNCCCCLL__PSRTOETPOS_/SsIiMzPeLoEf](/TN)C)C L{_ S T| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P S /| s group(groupi zeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :{655 : 11| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| group(group 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677 : 11 : note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here prims( t677i | d - t idStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :warning: 391initializer order does not match the declaration order [-Wreorder-ctor]: 95: note: expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ P| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O TO_##p r563o | t o > ( )s.treupnS(i&znec(cnlcSchlmSehmm.ewmo.rcokm)m;. b\u f f| S ^i zes[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Tnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'_ SIMP L562E | ] / N C CtLi_dS(TtEiPdS)/,s inztehorfe(aTd)s)( n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ) group(group, tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h677r:e11a:d Inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex .x), g r677o | u p ( g r o u p ) , p r| i ^~~~~~~~~~~~~~~~~m s(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:-562t:i60d:S tnote: afield 'group' will be initialized after field 'stepSize'r tBcas t562, | n T h rteiadd(stBicda)s,t ,n t&hdrieraedcst(-n>tohurte,a ddsi)r,e ctti-d>IdnoBwlno,c ka(rtghsr-e>asdeInddxb.uxf)f,, garrogusp-(>grreocuvpb)u,f f ,| ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkt,i dN(CtCiLd_)A,L GnOt_h#r#eaaldgso(,n tNhCrCeLa_dPsR)O,T Ot_i#d#IpnrBoltooc>k(()t.hrruena(d&Indcxc.lxS)h,m egmr.owuopr(kg)r;o u\p ) ,| ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :563562 | : 15 : note: sfield 'nthreads' will be initialized after field 'tidInBlock't epSize (562n | c c l S htmiedm(.tciodm)m,. bnutfhfrSeiazdess([nNtChCrLe_aPdRsO)T,O _tSiIdMIPnLBEl]o/cNkC(CtLh_rSeTaEdPISd/xs.ixz)e,o fg(rTo)u)p ({g r o| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p ) ,| group(group | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hfield 'group' will be initialized after field 'stepSize': 677:11: note: 562in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | t i677d | ( t i d ) , n t h rperaidmss((nttihdr-etaiddsS)t,a rttiBdcIansBtl,o cnkT(htrheraedasdBIcdaxs.tx,) ,& dgirroeucpt(-g>roouutp,) ,d i r| e ^~~~~~~~~~~c t->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:e562m:e15n:t (n)t.hrruena(dwse()n;t h r| e ^a ds), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppi:d9I:n1B:l onote: cin instantiation of member function 'RunWork, 2, 2>::run' requested herek (thr e9a | dIIMdPxL._xC)O,L Lg_rFoUuNpC((gArloluRpe)d,u c e| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ C O| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L NET_D I563R | E C T , sStIeMpPSLiEz,e (Snucmc,l Suhimnetm6.4c_otm)m . b| u^f fSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:s391[:N95C:C Lnote: _expanded from macro 'IMPL_COLL_FUNC'P ROTO_S I391M | P L ER]u/nNWCoCrLk_, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here> , NCC L687_ | A L G O _ # # a l g op/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,r iNmCsC:(L562t_:iP15dR:-O tTwarning: iOdinitializer order does not match the declaration order [-Wreorder-ctor]_S #t#aprrtoBtcoa>s(t),. rnu Tn562h( | r& en a cd cstlBiScdha(mstetim,d. )w&,od rinkrt)eh;cr te\-a d> so| (u ^ntt ,h rn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.heu:al562dl:sp15)t:,r ,tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'ad Irng Bs562l- | o> cs ke (n tdthbirudef(aftd,Iid dxa).r,xg )sn,-t >hgrrreeocauvpdb(sug(rfnoftu,hp r) e,| a ^d s| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : t202| i: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d53 I:n Bnote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo c563 k | 202( | t h r set ea pd SI id zxRe.u(xnn)Wc,oc lrgSkhrEmoleuempm.(ecgnortmo E(t])i/.dNrC(utCniL(d_w)Se,T) E;nP tS /h| sr ^ie zaedo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppsf:((10Tn:)t1)h: r {enote: ain instantiation of member function 'RunWork, 2, 2>::run' requested here d | s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ) 10, | | It group(groupMi PdLI_nCBOlLoLc_kF(UtNhCr(eAaldlI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hRd:ex677d.:ux11c):e, , note: gin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereCr OoLuLpN(Eg Tr677_o | Du Ip R )E ,C T , | ^~~~~~~~~~~S I MpPrLiEm,s (Stuimd,- thiadlSft)a r t| B^c ast,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n391T:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC's Bcast, 391& | d i rReucntW-o>rokufduonwcn,, tayrpges,- >Fsuenncd#b#udfefv,r eadrogps<-t>yrpeec>v,b uNfCfC,L _ A| L ^G O_##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:,202 :N53C:C Lnote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereP ROT O202_ | # # p r o t o > (R)u.nrWuonr(k&EnlcecmleSnhtm (note: )field 'nthreads' will be initialized after field 'tidInBlock'. run(w e562) | ; | ^t id(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp,: 9n:t1h:r enote: ain instantiation of member function 'RunWork, 2, 2>::run' requested hered s(nt h9r | eIaMdPsL)_,C OtLiLd_IFnUBNlCo(cAkl(ltRherdeuacdeI,d xC.OxL)L,N EgTr_oDuIpR(EgCrTo,u pS)I,M P L| E ^~~~~~~~~~~~~~~~~, Su/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:,562 :u60i:n tnote: 6field 'group' will be initialized after field 'stepSize'4 _t) | 562^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthrea d391s | ( n tRhurneWaodrsk)<,n ctcildFIunnBcl#o#cfku(ntch,r etaydpIed,x .Fxu)n,c #g#rdoeuvpr(egdroopu| , ^~~~~~~~~~~ NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkI,n BNlCoCcLk_(AtLhGrOe_a#d#Iadlxg.ox,) ,N CgCrLo_uPpR(OgTrOo_u#p#)p,r o t| o ^~~~~~~~~~~~~~~~~> ().run(&nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e60m:. wnote: ofield 'group' will be initialized after field 'stepSize'r k); \ | 562 ^ | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, note: nfield 'nthreads' will be initialized after field 'tidInBlock't hread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~d x.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]t idInBloc k562( | t h r e atdiIdd(xt.ixd)),, gnrtohurpe(agdrso(unpt)h,r e a| d ^~~~~~~~~~~s ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::562641::1511:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | 562 | t i d ( tpirdi)m,s (nttihdr-etaiddsS(tnatrhtrReeaddusc)e,, tniTdhIrneBaldoscRke(dtuhcree,a ddIidrxe.cxt)-,> dgorwonu,p (&gdrioruepc)t,- > o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t , | a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r gs->s e563n | d b u f fs,t eaprSgisz-e>(rnecccvlbSuhfmfe,m . c| o ^m m.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:S202i:z53e:s [note: Nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC CL_P R202O | T O _ S I M P L ER]u/nNWCoCrLk_ESlTeEmPeSn/ts(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):.641r:u11n:( wnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) ; | ^ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp : 10 : 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested herep rims (10t | iIdM-PtLi_dCSOtLaLr_tFRUeNdCu(cAel,l RneTdhurceea,d sCROeLdLuNcEeT,_ DdIiRrEeCcTt,- >SdIoMwPnL,E ,& dSiurme,c th-a>lofu)t , | a^r gs->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:e391n:d95b:u fnote: fexpanded from macro 'IMPL_COLL_FUNC', args- >391r | e c vRbuunfWfo,r k <| n ^c clFunc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:#202f:u53n:c ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret ype, 202F | u n c # # d e v rReudnoWpom,e nNtC_(#)#.prruont(ow>e());. r u| n ^( &nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppS:h11m:e1m:. wnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested herer k); \11 | I| M ^P L_CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562_:F15U:N Cnote: (field 'nthreads' will be initialized after field 'tidInBlock'A llRed u562c | e , C OtLiLdN(EtTi_dD)I,R EnCtTh,r eSaIdMsP(LnEt,h rSeuamd,s )f,l otaitd)I n B| l^o ck(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:I dnote: xexpanded from macro 'IMPL_COLL_FUNC'. x), gr o391u | p ( gRruonuWpo)r,k < n| c ^~~~~~~~~~~~~~~~~c lF/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562c:#60#:f unote: nfield 'group' will be initialized after field 'stepSize'c , typ e562, | F u n ct#i#dd(etvirde)d,o pnd,s (NnCtChLr_eAaLdGsO)_,# #taildgIon,Block(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562):,15 : | warning: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~initializer order does not match the declaration order [-Wreorder-ctor] | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562563 | | tsitde(ptSiidz)e,( nnctchlrSehamdesm(.nctohmrme.abdusf)f,S itziedsI[nNBClCoLc_kP(RtOhTrOe_aSdIIMdPxL.Ex])/,N CgCrLo_uSpT(EgPrSo/uspi)z,e o f| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T ) )| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T){ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 | | group(group stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hz:e666(:n9c:c lnote: Sin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh mem.c o666m | m . b u f f S i zpersi[mNsC(CtLi_dP,R OnTTOh_rSeIaMdPsLGEa]t/hNeCrC,L _dSiTrEePcSt/-s>iuzpe,o fN(UTL)L), {a r g| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- > s| e group(groupn dbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:r677g:s11-:> rnote: ein instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec vbuff ,677 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :p202r:i53m:s (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d-ti d202S | t a r t B c a s tR,u nnWTohrrkeEaldesmBecnatspo,u tA,l gdoi,r ePcrto-t>od>o(w)n.,r uanr(gwse-)>;s e n| d ^b uff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr:g10s:-1>:r enote: cin instantiation of member function 'RunWork, 2, 2>::run' requested herev buf f10, | I M| P ^L _COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hU:N202C:(53A:l lnote: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree duce, 202C | O L L N E T _ D IRRuEnCWTo,r kSEIlMePmLeEn,t expanded from macro 'IMPL_COLL_FUNC'( ).run (391w | e ) ;R u n| W ^o rk, 2, 2>::run' requested heren c, ty p10e | ,I MFPuLn_cC#O#LdLe_vFrUeNdCo(pAu,c eN,C CCLO_LALLNGEOT__#D#IaRlEgCoT,, NSCICMLP_LPER,O TSOu_m#,# phraoltfo)> ( )| .^r un(&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:c391c:l95S:h mnote: eexpanded from macro 'IMPL_COLL_FUNC'm .work); 391\ | | R ^u nWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:<562n:c15c:l Fnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n c##f u562n | c , t ytpied,( tFiudn)c,# #ndtehvrreeaddosp(a,d sN)C,C Lt_iAdLIGnOB_l#o#cakl(gtoh,r eNaCdCILd_xP.RxO)T,O _g#r#opurpo(tgor>o(u)p.)r,u n (| & ^~~~~~~~~~~~~~~~~n cc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562h:m60e:m .note: wfield 'group' will be initialized after field 'stepSize'o rk); 562\ | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads(n t562h | r e a d st)i,d (ttiidd), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp), | : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 : 15| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) warning: initializer order does not match the declaration order [-Wreorder-ctor] 563 | step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((Tg)r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| group(group | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641s:t11e:p Snote: iin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herez e(nccl S641h | m e m . c o m m . b upfrfiSmisz(etsi[dN-CtCiLd_SPtRaOrTtOR_eSdIuMcPeL,E ]n/TNhCrCeLa_dSsTREePdSu/csei,z edoifr(eTc)t)- >{d o w| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, &| d group(groupi rect->out, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:g666s:-9>:s enote: nin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered buff ,666 | a r g s - > r e cpvrbiumfsf(,t i d| , ^ nThreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hG:a202t:h53e:r ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered irect -202> | u p , N U L L ,R uanrWgosr-k>EsleenmdebnutferdeOcpv,b uAflfg,o , | P ^r oto>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:r202u:n53(:w enote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here; | ^202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp : 12R:u1n:W onote: rin instantiation of member function 'RunWork, 2, 2>::run' requested herek Elem e12n | tIO(L)L.NrEuTn_(DwIeR)E;C T ,| ^S IMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp,: 11S:u1m:, note: din instantiation of member function 'RunWork, 2, 2>::run' requested hereo ubl e11) | I M| P^L _COLL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hF:U391N:C95(:A lnote: lexpanded from macro 'IMPL_COLL_FUNC'R educe, C O391L | L N ERTu_nDWIoRrEkC ,391 | N C CRLu_nAWLoGrOk_<#n#cacllgFou,n cN#C#CfLu_nPcR,O TtOy_p#e#,p rFoutnoc>#(#)d.ervurne(d&onpce,m .NwCoCrLk_)A;L G\O _ #| # ^a lgo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:P Rnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'T O_##p r562o | t o > ( )t.irdu(nt(i&dn)c,c lnSthhmreema.dwso(rnkt)h;r e\a d s| ) ^, tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s )note: ,field 'group' will be initialized after field 'stepSize' tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~t idInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k60(:t hnote: rfield 'group' will be initialized after field 'stepSize'e adIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~a ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hreadI:d562x:.15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]g roup(group), 562| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(60t:i dnote: )field 'group' will be initialized after field 'stepSize', nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(g r563o | u p ) , s t| e ^~~~~~~~~~~p Size(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:e562m:.15c:o mwarning: minitializer order does not match the declaration order [-Wreorder-ctor]. buffSizes [562N | C C L _ PtRiOdT(Ot_iSdI)M,P LnEt]h/rNeCaCdLs_(SnTtEhPrSe/asdisz)e,o ft(iTd)I)n B{l o c| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t h| r group(groupe adIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :g677r:o11u:p (note: gin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oup), 677| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) p563r | i m s ( tsitde-ptSiidzSet(anrctcBlcSahsmte,m .ncTohmrme.abdusfBfcSaiszte,s [&NdCiCrLe_cPtR-O>ToOu_tS,I MdPiLrEe]c/tN-C>CdLo_wSnT,E PaSr/gssi-z>esoefn(dTb)u)f f{, a| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g s -| > group(groupr ecvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677| : ^11 : note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :677 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | p r i m s ( tRiudn-WtoirdkSEtlaermteBncta-(>)o.urtu,n (dwier)e;c t -| > ^d own, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr:g11s:-1>:s enote: nin instantiation of member function 'RunWork, 2, 2>::run' requested hered buff ,11 | aIrMgPsL-_>CrOeLcLv_bFuUfNfC,( A l| l ^R educe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202C:O53L:L Nnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereT _DIR E202C | T , S I M P L ER,u nSWuomr,k Efllemento( )391. | r u nR(uwneW)o;r k <| n ^c clFun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppc:#10#:f1u:n cnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here type ,10 | FIuMnPcL#_#CdOeLvLr_eFdUoNpC<(tAylpleR>e,d uNcCeC,L _CAOLLGLON_E#T#_aDlIgRoE,C TN,C CSLI_MPPRLOET,O _S#u#mp,r ohtaol>f()) . r| u^n (&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:c391l:S95h:m enote: mexpanded from macro 'IMPL_COLL_FUNC'. work); 391\ | | R ^u nWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562<:n15c:c lnote: Ffield 'nthreads' will be initialized after field 'tidInBlock'u nc##f u562n | c , t ytpied,( tFiudn)c,# #ndtehvrreeaddosp(a,d sN)C,C Lt_iAdLIGnOB_l#o#cakl(gtoh,r eNaCdCILd_xP.RxO)T,O _g#r#opurpo(tgor>o(u)p.)r,u n (| & ^~~~~~~~~~~~~~~~~n cc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562h:m60e:m .note: wfield 'group' will be initialized after field 'stepSize'o rk); \562 | | ^ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i15d:) ,note: field 'nthreads' will be initialized after field 'tidInBlock'n threa d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~x .x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group m.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562h:m15e:m .warning: winitializer order does not match the declaration order [-Wreorder-ctor]o rk); \ 562 | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d x.x )563, | g r o uspt(egprSoiuzpe)(,n c c| l ^~~~~~~~~~~~~~~~~S hmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:o562m:m60.:b unote: ffield 'group' will be initialized after field 'stepSize'f Sizes[N C562C | L _ P R OtTiOd_(StIiMdP)L,E ]n/tNhCrCeLa_dSsT(EnPtSh/rseiazdeso)f,( Tt))i d{I n B| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o c k| ( group(groupt hreadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:)626,: 9g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep (gro u626p | ) , | ^~~~~~~~~~~ prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562,: 15a:r gwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]- >sendbuff, a562r | g s - > rteicdv(btuifdf),, n| t ^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(202n:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s), 202t | i d I n B l o c kR(utnhWroerakdEIldexm.exn)t,< Fgnr,o uTp,( gRreoduOpp),, A l| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o , | P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oto> (563) | . r u n (swtee)p;S i z| e ^( ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppm:e13m:.1c:o mnote: min instantiation of member function 'RunWork, 2, 2>::run' requested here. buf f13S | iIzMePsL[_NCCOCLLL__PFRUONTCO(_ASlIlMRPeLdEu]c/eN,C CCLO_LSLTNEEPTS_/DsIiRzEeCoTf,( TS)I)M P{L E ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S u m| , group(group rccl_bfloa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:16666:)9 : | note: ^in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :66695 | : note: expanded from macro 'IMPL_COLL_FUNC' pr i391m | s ( tRiudn,W onrTkh,u pF,u nNcU#L#Ld,e varregdso-p>u,f fN,C CaLr_gAsL-G>Or_e#c#vablugfof,, N C| C ^L _PROTO_##p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o202t:o53>:( )note: .in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer un(&n c202c | l S h m e m . w oRrukn)W;o r\k E l| e ^m entd()),. rnutnh(rweea)d;s ( n| t ^hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ orkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h), gr:o562u:p15(:g rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]u p), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56260 | : note: field 'group' will be initialized after field 'stepSize' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p ), | ^~~~~~~~~~~563 | stepSize(ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562c:o15m:m .warning: binitializer order does not match the declaration order [-Wreorder-ctor]u ffSizes [562N | C C L _ PtRiOdT(Ot_iSdI)M,P LnEt]h/rNeCaCdLs_(SnTtEhPrSe/asdisz)e,o ft(iTd)I)n B{l o c| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t h| r group(groupe adIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:(641g:r11o:u pnote: )in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 641| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | p rsitmesp(Stiizde-(tnicdcSltSahrmteRme.dcuocmem,. bnuTfhfrSeiazdessR[eNdCuCcLe_,P RdOiTrOe_cStI-M>PdLoEw]n/,N C&CdLi_rSeTcEtP-S>/osuitz,e oafr(gTs)-)> s{e n d| b ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u f f| , group(group args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:c677v:b11u:f fnote: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 677 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : pnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei ms(t i202d | - t i d S t a r tRBucnaWsotr,k EnlTehmreenatdgoou,t ,P rdoitroe>c(t)-.>rduonw(nw,e )a;r g s| - ^> sendbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppf:,11 :a1r:g snote: -in instantiation of member function 'RunWork, 2, 2>::run' requested here> recv b11u | fIfM,P L _| C ^O LL_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C202(:A53l:l Rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered uce ,202 | C O L L N E T _ DRIuRnEWCoTr,k ESlIeMmPeLnEt,< FSnu,m ,T ,f lRoeadtO)p , | A^l go, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391t:o95>:( )note: .expanded from macro 'IMPL_COLL_FUNC'r un(we); 391 | | ^ RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr:k11<:n1c:c lnote: Fin instantiation of member function 'RunWork, 2, 2>::run' requested hereu nc# #11f | uInMcP,L _tCyOpLeL,_ FFUuNnCc(#A#ldleRverdeudcoep,< tCyOpLeL>N,E TN_CDCILR_EACLTG,O _S#I#MaPlLgEo,, SNuCmC,L _fPlRoOaTtO)_ # #| p^r oto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:)391.:r95u:n (note: &expanded from macro 'IMPL_COLL_FUNC'n cclSh m391e | m . wRournkW)o;r k\< n c| c ^l Func##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:u562n:c15,: tnote: yfield 'nthreads' will be initialized after field 'tidInBlock'p e, Fu n562c | # # d e vtriedd(otpit,h rNeCaCdLs_(AnLtGhOr_e#a#dasl)g,o ,t iNdCICnLB_lPoRcOkT(Ot_h#r#epardoItdox>.(x)).,r ugnr(o&unpc(cglrSohumpe)m,. w o| r ^~~~~~~~~~~~~~~~~k ); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h\: 562 :| 60 ^: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | note: field 'nthreads' will be initialized after field 'tidInBlock' tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~r oup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NC(CnLt_hArLeGaOd_s#)#, tiadlIgnoB,l oNcCkC(Lt_hPrReOaTdOI_d#x#.pxr)o,t og>r(o)u.pr(ugnr(o&unpc)c,l S h| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e m .| w tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o rk); \ | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :s562t:e15p:S inote: zfield 'nthreads' will be initialized after field 'tidInBlock'e (nccl S562h | m e m . ctoimdm(.tbiudf)f,S inztehsr[eNaCdCsL(_nPtRhOrTeOa_dSsI)M,P LtEi]d/INnCBClLo_cSkT(EtPhSr/esaidzIedoxf.(xT)),) g{r o u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( g r| o group(groupu p), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h60::677 :note: 11field 'group' will be initialized after field 'stepSize': note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 677 | t i d ( t i d ) , nptrhirmesa(dtsi(dn-tthirdeSatdasr)t,B ctaisdtI,n BnlTohcrke(atdhsrBecaadsItd,x .&xd)i,r egcrto-u>po(ugtr,o udpi)r,e c t| - ^~~~~~~~~~~> down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cwarning: kinitializer order does not match the declaration order [-Wreorder-ctor]( threadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s )note: ,field 'group' will be initialized after field 'stepSize' tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I nBlo c563k | ( t h r esatdeIpdSxi.zxe)(,n cgcrloSuhpm(egmr.ocuopm)m,. b u| f ^~~~~~~~~~~f Sizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rgs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 562: | warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ( g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up), | 563 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) step S563i | z e ( n csctleSphSmiezme.(cnocmcml.SbhumfefmS.iczoemsm[.NbCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h[:N562C:C15L:_ Pwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]O TO_SIMPLE ]562/ | N C C L _tSiTdE(PtSi/ds)i,z enotfh(rTe)a)d s{( n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups ), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:o641c:k11(:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree adIdx. x641) | , g r o u p ( g r opurpi)m,s ( t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d - t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Start R563e | d u c stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hrk);: 562\: 15 :| ^warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup), 563 | | ^~~~~~~~~~~~~~~~~ s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562p:S60i:z enote: (field 'group' will be initialized after field 'stepSize'n cclS h562m | e m . c otmimd.(btuifdf)S,i znetsh[rNeCaCdLs_(PnRtOhTrOe_aSdIsM)P,L Et]i/dNICnCBLl_oScTkE(PtSh/rseiazdeIodfx(.Tx))), {g r o| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p ( g| r group(groupo up), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | Ru:n562W:o15r:k ),, NtCiCdLI_nABLlGoOc_k#(#tahlrgeoa,d INdCxC.Lx_)P,R OgTrOo_u#p#(pgrrootuop>)(,) . r| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n ( &| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c clShm e563m | . w o r ks)t;e p\S i z| e ^( ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:e562m:.15c:o mnote: mfield 'nthreads' will be initialized after field 'tidInBlock'. buffS i562z | e s [ N CtCiLd_(PtRiOdT)O,_ SnItMhPrLeEa]d/sN(CnCtLh_rSeTaEdPsS)/,s itziedoIfn(BTl)o)c k{( t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| I group(groupd x.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:)655,: 11 :| ^~~~~~~~~~~~~~~~~note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: 655note: | field 'group' will be initialized after field 'stepSize' 562 | p r itmisd((ttiidd-)t,i dnSttharretaRdesd(uncteh,r enaTdhsr)e,a dtsiRdeIdnuBcleo,c kn(utlhlrpetard,I d&xd.ixr)e,c tg-r>oouupt(,g raorugps)-,> s e| n ^~~~~~~~~~~d buff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:#562#:p15r:o twarning: oinitializer order does not match the declaration order [-Wreorder-ctor]> ().run(&n c562c | l S h m etmi.dw(otrikd));, \n t h| r ^e ads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: )field 'nthreads' will be initialized after field 'tidInBlock', tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B lock( t563h | r e a d Isdtxe.pxS)i,z eg(rnocucpl(Sghrmoeump.)c,o m m| . ^~~~~~~~~~~~~~~~~b uff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e60s:[ Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_PRO T562O | _ S I M PtLiEd](/tNiCdC)L,_ SnTtEhPrSe/asdisz(enotfh(rTe)a)d s{) , | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d I| n group(groupB lock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a655d:I11d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , grou p655( | g r o u p ) , | ^~~~~~~~~~~p rims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N562C | C L _ A LtGiOd_(#t#iadl)g,o ,n tNhCrCeLa_dPsR(OnTtOh_r#e#apdrso)t,o >t(i)d.IrnuBnl(o&cnkc(ctlhSrhemaedmI.dwxo.rxk)),; g\r o u| p ^( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 15 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562 | s t etpiSdi(ztei(dn)c,c lnSthhmreema.dcso(mnmt.hbruefafdSsi)z,e st[iNdCICnLBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:,562 :t15i:d Iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]B lock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s ), ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~[ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _PROT O563_ | S I M P LsEt]e/pNSCiCzLe_(SnTcEcPlSS/hsmiezme.ocfo(mTm).)b u{f f S| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~z e s| [ group(groupN CCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:S916I:M7P:L Enote: ]in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here/ NCCL_ S916T | E P S / s i zperoifm(sT()g)r o{u p T| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d , | g group(groupr oupNthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a916d:s7,: ¬e: rin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested heree cv, & s916e | n d , a r gpsr-i>msse(ngdrbouufpfT,i da,r ggsr-o>urpeNctvhbruefafd,s , | & ^r ecv, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h&:s202e:n53d:, note: ain instantiation of member function 'RunWorkElement, 3, 2>::run' requested herer gs-> s202e | n d b u f f , aRrugnsW-o>rrkeEclvebmuefnft,< F n| , ^ T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:d202O:p53,: Anote: lin instantiation of member function 'RunWorkElement, 3, 2>::run' requested hereg o, P r202o | t o > ( ) . r u nR(uwneW)o;r k E| l ^e ment, 3, 2>::run' requested hered Op, A5l | gIoM,P LP_rCoOtLoL>_(F)U.NrCu(nA(lwleR)e;d u c| e ^, COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cppE:T7_:C1H:A Inote: Nin instantiation of member function 'RunWork, 3, 2>::run' requested here, SIM P7L | EI,M PMLa_xC,O LuLi_nFtU8N_Ct()A l l| R^e duce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :C391O:L95L:N Enote: Texpanded from macro 'IMPL_COLL_FUNC'_ CHAIN, 391S | I M PRLuEn,W oMrakx<,n cucilnFtu3n2c_#t#)f u n| c^, typ/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:,391 :F95u:n cnote: #expanded from macro 'IMPL_COLL_FUNC'# devredo p391< | t y pReu>n,W oNrCkCd(o)p.n,c cNlCSChLm_eAmL.GwOo_r#k#)a;l g\o , | N ^C CL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:O562T:O15_:# #note: pfield 'nthreads' will be initialized after field 'tidInBlock'r oto>( )562. | r u n ( &tnicdc(ltSihdm)e,m .nwtohrrke)a;d s\( n t| h ^r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'n Block (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i60d:I nnote: Bfield 'group' will be initialized after field 'stepSize'l ock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~e ad/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Block (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~d s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d Idx.x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562t:,15 :a rwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]s ->sendbuff, 562a | r g s - >triedc(vtbiudf)f,, n t| h ^r eads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tidI n202B | l o c k ( t h r eRaudnIWdoxr.kxE)l,e mgernotu ( ) .srtuenp(Swiez)e;( n c| c ^l Shmem.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppo:m4m:.1b:u fnote: fin instantiation of member function 'RunWork, 2, 2>::run' requested hereS izes [4N | CICMLP_LP_RCOOTLOL__SFIUMNPCL(EA]l/lNRCeCdLu_cSeT,E PCSO/LsLiNzEeTo_fD(ITR)E)C T{, S| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M P L| E group(group, Prod, int8_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| :^687 :11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 687 | 391 | R u npWroirmks<(ntcicdl-FtuindcS#t#afrutnBcc,a stty,p en,T hFruenacd#s#Bdceavsrte, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hIdx.x:)562,: 15g:r owarning: uinitializer order does not match the declaration order [-Wreorder-ctor]p (group), | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t60i:d (note: tfield 'group' will be initialized after field 'stepSize'i d), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up(g r563o | u p ) , s t| e ^~~~~~~~~~~p Size(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h \ | : ^562 :15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hinitializer order does not match the declaration order [-Wreorder-ctor]: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60 :563 | note: field 'group' will be initialized after field 'stepSize' ste p562S | i z e ( ntcicdl(Sthimde)m,. cnotmhmr.ebaudfsf(Snitzherse[aNdCsC)L,_ PtRiOdTIOn_BSlIoMcPkL(Et]h/rNeCaCdLI_dSxT.ExP)S,/ sgirzoeuopf((gTr)o)u p{) , | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL_AL:G562O:_15#:# awarning: linitializer order does not match the declaration order [-Wreorder-ctor]g o, NCCL_PROT O562_ | # # p r ottiod>((t)i.dr)u,n (n&tnhcrcelaSdhsm(enmt.hwroerakd)s;) ,\ t i| d ^I nBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Inote: dfield 'nthreads' will be initialized after field 'tidInBlock'x .x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads( n563t | h r e a dsst)e,p StiizdeI(nnBclcolcSkh(mtehmr.ecaodmImd.xb.uxf)f,S igzreosu[pN(CgCrLo_uPpR)O,T O _| S ^~~~~~~~~~~~~~~~~I MPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:]562/:N60C:C Lnote: _field 'group' will be initialized after field 'stepSize'S TEPS/ s562i | z e o f (tTi)d)( t{i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h group(groupr eads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i626d:I9n:B lnote: oin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec k(thr e626a | d I d x . x ) , pgrriomusp((tgirdo-utpi)d,S t a| r ^~~~~~~~~~~t Scatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartRedu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6015:: note: warning: field 'group' will be initialized after field 'stepSize'initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~r oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :p1r: iIn file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:(10t: iIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h-:t167i: d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:t562a:r15t:B cwarning: ainitializer order does not match the declaration order [-Wreorder-ctor]s t, nThreads B562c | a s t , t&iddi(rteicdt)-,> onutth,r edaidrse(cntt-h>rdeoawdns,) ,a rtgisd-I>nsBelnodcbku(ftfh,r eaardgIsd-x>.rxe)c,v bgurfofu,p ( g| r ^o up), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 53 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 563202 | | s t e p SRiuzneW(onrckcEllSehmmeenmt.S(I)M.PrLuEn](/wNeC)C;L _ S| T ^E PS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppi:z5e:o1f:( Tnote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here) { 5| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I M P| L group(group_ COLL_FUNC(AllR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:d626u:c9e:, note: Cin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO LLNET_D I626R | E C T , S I M PpLrEi,m s(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N562C | C L _ A LtGiOd_(#t#iadl)g,o ,n tNhCrCeLa_dPsR(OnTtOh_r#e#apdrso)t,o >t(i)d.IrnuBnl(o&cnkc(ctlhSrhemaedmI.dwxo.rxk)),; g\r o u| p ^( group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~15 : | note: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562 | s tteipdS(itzied()n,c cnltShhrmeeamd.sc(onmtmh.rbeuafdfsS)i,z etsi[dNICnCBLl_oPcRkO(TtOh_rSeIaMdPILdEx]./xN)C,C Lg_rSoTuEpP(Sg/rsoiuzpe)o,f ( T| ) ^~~~~~~~~~~~~~~~~) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 60 :| group(groupnote: field 'group' will be initialized after field 'stepSize' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i666d:)9,: nnote: tin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh reads (666n | t h r e a d s ) ,p rtiimdsI(ntBildo,c kn(TthhrreeaaddsIGdaxt.hxe)r,, gdrioruepc(tg-r>ouupp,) ,N U L| L ^~~~~~~~~~~, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorka,d sN(CnCtLh_rAeLaGdOs_)#,# atligdoI,n BNlCoCcLk_(PtRhOrTeOa_d#I#dpxr.oxt)o,> (g)r.oruupn((g&rnocucpl)S,h m e| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. w o| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k ); \ 563| | ^ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e15(:n cnote: cfield 'nthreads' will be initialized after field 'tidInBlock'l Shmem. c562o | m m . b utfifdS(itzieds)[,N CnCtLh_rPeRaOdTsO(_nStIhMrPeLaEd]s/)N,C CtLi_dSITnEBPlSo/cski(ztehorfe(aTd)I)d x{. x )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ g r| o group(groupu p(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 655 ^~~~~~~~~~~~~~~~~: 11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 :60: note: field 'group' will be initialized after field 'stepSize'655 | 562 | ptriidm(st(id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkt,i dNSCtCaLr_tARLeGdOu_c#e#,a lngToh,r eNaCdCsLR_ePdRuOcTeO,_ #n#uplrlopttor>,( )&.driurne(c&tn-c>coluSth,m eamr.gwso-r>ks)e;n d\b u f| f ^, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:>562r:e15c:v bnote: ufield 'nthreads' will be initialized after field 'tidInBlock'f f, | ^562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202n:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s(nth r202e | a d s ) , t i dRIunnBWloorckkE(ltehmreenatd().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562(:w60e:) ;note: field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp : 4 : 1t:i dnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested heret id) ,4 | nItMhPrLe_aCdOsL(Ln_tFhUrNeCa(dAsl)l,R etdiudcIen,B lCoOcLkL(NtEhTr_eDaIdRIEdCxT.,x )S,I MgPrLoEu,p (Pgrroodu,p )i,n t 8| _ ^~~~~~~~~~~t ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15(:g rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]u p), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 60 : tnote: ifield 'group' will be initialized after field 'stepSize'd (tid )562, | n t h tid(tirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562p:r15i:m swarning: (initializer order does not match the declaration order [-Wreorder-ctor]t id-tidSt a562r | t S c a tttiedr(,t indT)h,r enatdhsrSecaadtst(enrt,h rNeUaLdLs,) ,d itriedcItn-B>luopc,k (atrhgrse-a>dsIednxd.bxu)f,f ,g raorugps(-g>rroeucpv)b,u f f| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :563202 | : 53 : note: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret epSi z202e | ( n c c l S h m eRmu.ncWoomrmk.EbluefmfeSnitz_(S)T.ErPuSn/(swiez)e;o f (| T ^) ) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~6 : 1| : group(group note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | :I687M:P11L:_ Cnote: Oin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL L_FUNC (687A | l l R e d u c e , CpOrLiLmNsE(Tt_iDdI-RtEiCdTS,t aSrItMBPcLaEs,t ,P rnoTdh,r eiandts3B2c_ats)t , | &^d irec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:-391>:o95u:t ,note: nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562B:c15a:s twarning: ,initializer order does not match the declaration order [-Wreorder-ctor] &dire c562t | - > o u tt,i dn(utlildp)t,r ,n tahrrgesa-d>ss(enntdhbruefafd,s )a,r gtsi-d>IrneBclvobcukf(ft,h r e| a ^d Idx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202g:r53o:u pnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg roup )202, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) RunWo r563k | E l e m esnttei(z)e.sr[uNnC(CwLe_)P;R O T| O ^_ SIMPLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppN:C4C:L1_:S Tnote: Ein instantiation of member function 'RunWork, 2, 2>::run' requested hereP S/si z4e | oIfM(PTL)_)C O{L L _| F ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~U N C| ( group(groupA llReduce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:O655L:L11N:E Tnote: _in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereD IRECT ,655 | S I M P L E , P r opdr,i misn(tt8i_dt-)t i d| S^t artRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:u391c:e95,: nnote: Texpanded from macro 'IMPL_COLL_FUNC'h readsR e391d | u c eR,u nnWuolrlkpnocu,t ,t yapreg,s -F>usnecn#d#bduefvfr,e daorpgpree>c,v bNuCfCfL,_ A L| G ^O _##algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202N:C53C:L _note: Pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR OTO_ #202# | p r o t o > ( )R.urnuWno(r&knEclcelmSehnmtefield 'nthreads' will be initialized after field 'tidInBlock'( ).run (562w | e ) ; t| i ^d (tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppn:t4h:r1e:a dnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here( nthr e4a | dIsM)P,L _tCiOdLILn_BFlUoNcCk((AtlhlrReeadduIcdex,. xC)O,L LgNrEoTu_pD(IgRrEoCuTp,) ,S I M| P ^~~~~~~~~~~~~~~~~L E, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:r562o:d60,: inote: nfield 'group' will be initialized after field 'stepSize't 8_t) 562| | ^ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthre a391d | s ( nRtuhnrWeoardks<)n,c ctliFduInncB#l#ofcukn(ct,h rteyapdeI,d xF.uxn)c,# #gdreovurpe(dgorpo ,| ^~~~~~~~~~~N CCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h15::202 :warning: 53initializer order does not match the declaration order [-Wreorder-ctor]: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562202 | | t RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hlShme:m562.:w15o:r kwarning: )initializer order does not match the declaration order [-Wreorder-ctor]; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56215 | : note: field 'nthreads' will be initialized after field 'tidInBlock' tid(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p), | 563 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t60e:p Snote: ifield 'group' will be initialized after field 'stepSize'z e(ncc l562S | h m e m .tciodm(mt.ibdu)f,f Snitzherse[aNdCsC(Ln_tPhRrOeTaOd_sS)I,M PtLiEd]I/nNBClCoLc_kS(TtEhPrSe/asdiIzdexo.fx()T,) )g r{o u p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:I dwarning: xinitializer order does not match the declaration order [-Wreorder-ctor]. x), group (562g | r o u p )t,i d (| t ^~~~~~~~~~~i d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nWorkElemelnetmo(t)o.>r(u)n.(rwuen)(;w e )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp4::61::1 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herenote: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | 6I | MIPMLP_LC_OCLOLL_LF_UFNUCN(CA(lAllRleRdeudcuec,e ,C OCLOLLNLENTE_TD_IDRIERCETC,T ,S ISMIPMLPEL,E ,P rPordo,d ,i nitn8t_3t2)_ t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h391::39195::95 :note: expanded from macro 'IMPL_COLL_FUNC'note: expanded from macro 'IMPL_COLL_FUNC' 391 | 391 | R uRnuWnoWrokre,> ,N CNCCLC_LA_LAGLOG_O#_##a#laglog,o ,N CNCCLC_LP_RPORTOOT_O#_##p#rportoot>o(>)(.)r.urnu(n&(n&cncclcSlhSmhemme.mw.owrokr)k;) ;\ \ | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6060:: note: note: field 'group' will be initialized after field 'stepSize'field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); ims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tPROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :field 'group' will be initialized after field 'stepSize'15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~r oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | i Rdu)n,W onrtkhErleeamdesn(tnd(I)d.xr.uxn)(,w eg)r;o u p| ( ^g roup)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp,: 5 :| 1 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 5 | IMP L563_ | C O L L _sFtUeNpCS(iAzlel(RnecdculcSeh,m eCmO.LcLoNmEmT._bDuIfRfESCiTz,e sS[INMCPCLLE_,P RPOrToOd_,S IuMiPnLtE8]_/tN)C C L| _^S TEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/:s391i:z95e:o fnote: (expanded from macro 'IMPL_COLL_FUNC'T )) { | 391 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | R group(groupu nWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereF unc##de v641r | e d o p < t y p e > ,p rNiCmCsL(_tAiLdG-Ot_i#d#Satlagrot,R eNdCuCcLe_,P RnOTThOr_e#a#dpsrRoetdou>c(e),. rduinr(e&cntc-c>ldSohwmne,m .&wdoirrke)c;t -\> o u| t ^, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:>562s:e15n:d bnote: ufield 'nthreads' will be initialized after field 'tidInBlock'f f, ar g562s | - > r e ctvibdu(ftfi,d ) ,| ^n threads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | :t562i:d15(:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]) , nthreads(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~e adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgo, N:C562C:L15_:P Rwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]T O_##proto>() .562r | u n ( & ntcicdl(Sthimde)m,. wnotrhkr)e;a d\s ( n| t ^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'n Block (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidIn B563l | o c k ( tshtreepaSdiIzdex(.nxc)c,l Sghrmoeump.(cgormomu.pb)u,f f S| i ^~~~~~~~~~~~~~~~~z es/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h[:N562C:C60L:_ Pnote: Rfield 'group' will be initialized after field 'stepSize'O TO_S I562M | P L E ] /tNiCdC(Lt_iSdT)E,P Sn/tshirzeeaodfs((Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | t group(groupi dInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d677I:d11x:. xnote: )in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, grou p677( | g r o u p ) , | ^~~~~~~~~~~p rims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| ^ :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h15::391 :warning: 95initializer order does not match the declaration order [-Wreorder-ctor]: note: expanded from macro 'IMPL_COLL_FUNC' 391 | 562 | R u n Wtoirdk(a,d INdCxC.Lx_)A,L GgOr_o#u#pa(lggroo,u pN)C,C L _| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R O T| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ ##pr o563t | o > ( ) .srtuenp(S&inzcec(lnSchcmleSmh.mweomr.kc)o;m m\. b u| f ^f Sizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:P Rnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'T O_SIMP L562E | ] / N C CtLi_dS(TtEiPdS)/,s inztehorfe(aTd)s)( n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ) group(group, tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a666d:I9d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , gr o666u | p ( g r o u p ) ,p r i| m ^~~~~~~~~~~~~~~~~s (ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:,562 :n60T:h rnote: efield 'group' will be initialized after field 'stepSize'a dsGat h562e | r , d itriedc(tt-i>du)p,, nNtUhLrLe,a dasr(gnst-h>rseeanddsb)u,f ft,i daIrngBsl-o>crke(ctvhbruefafd,I d x| . ^x ), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(202g:r53o:u pnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^~~~~~~~~~~202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' TO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :s15t:e pwarning: Sinitializer order does not match the declaration order [-Wreorder-ctor]i ze(ncclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~641 : 11| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | 641 | s t e p S i z e ( n cpcrliSmhsm(etmi.dc-otmimd.SbtuafrftSRiezdeusc[eN,C CnLT_hPrReOaTdOs_RSeIdMuPcLeE,] /dNiCrCeLc_tS-T>EdPoSw/ns,i z&edoifr(eTc)t)- >{o u t| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ a r| g group(groups ->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:f626f:,9 :a rnote: gin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres ->rec v626b | u f f , | ^ prims(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h-:t202i:d53S:t anote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret Scat t202e | r , n T h r e aRdusnSWcoartktEelre,m eNnUtLOupp,, Aalrggos,- >Psreontdob>u(f)f.,r uanr(gwse-)>;r e c| v ^b uff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp ^: 5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53 :5 | note: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereM PL_C O202L | L _ F U N C ( A lRluRneWdourckeE,l eCmOeLnLtNi(n)t.8r_utn)( w e| )^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppnote: :expanded from macro 'IMPL_COLL_FUNC'6 :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 391 | R6u | nIWMoPrLk_E,, NPCrCoLd, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &d_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ irect->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.wor:k562):;15 :\ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | note: field 'nthreads' will be initialized after field 'tidInBlock' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p ), | ^~~~~~~~~~~~~~~~~ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :s60t:e pnote: Sfield 'group' will be initialized after field 'stepSize'i ze(n c562c | l S h m etmi.dc(otmimd.)b,u fnftShirzeeasd[sN(CnCtLh_rPeRaOdTsO)_,S ItMiPdLIEn]B/lNoCcCkL(_tShTrEePaSd/Isdixz.exo)f,( Tg)r)o u{p ( g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ) group(group, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkt,h rNeCaCdLs_(AnLtGhOr_e#a#dasl)g,o ,t iNdCICnLB_lPoRcOkT(Ot_h#r#epardoItdox>.(x)).,r ugnr(o&unpc(cglrSohumpe)m,. w o| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~k ) ;| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)\ | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t15e:p Snote: ifield 'nthreads' will be initialized after field 'tidInBlock'z e(ncclS h562m | e m . c otmimd.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbufft,a r t| S ^c atter,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n202T:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres Scat t202e | r , N U L L , RduinrWeocrtk-E>luepm,e natr sTe,n dRbeudfOfp,, aArlggso-,> rPercovtbou>f(f),. r u| n ^( we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h7::2021::53 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herenote: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 7 | I M202P | L _ C O L L _ F URNuCn(WAolrlkREeldeumceen,t o(d),. ruuinn(tw3e2)_;t ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::6391::195:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereexpanded from macro 'IMPL_COLL_FUNC' 6 | I391M | P L _RCuOnLWLo_rFkU,, iNnCtC3L2__AtL)G O _| #^# algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391N:C95C:L _note: Pexpanded from macro 'IMPL_COLL_FUNC'R OTO_## p391r | o t oR>u(n)W.orrukn<(n&cncclcFluSnhcm#e#mf.uwnocr,k )t;y p\e , | F ^u nc##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:e562v:r15e:d onote: pfield 'nthreads' will be initialized after field 'tidInBlock'< type> ,562 | N C C L _tAiLdG(Ot_i#d#)a,l gnot,h rNeCaCdLs_(PnRtOhTrOe_a#d#sp)r,o ttoi>d(I)n.Brluonc(k&(ntchcrleSahdmIedmx..wxo)r,k )g;r o\u p (| g ^r oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^~~~~~~~~~~~~~~~~: 15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'nthreads' will be initialized after field 'tidInBlock': 60: note: field 'group' will be initialized after field 'stepSize' 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~) , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~562 :60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562655: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] pr i562m | s ( t i dt-itdi(dtSitda)r,t Rneduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d (sttiedp)S,i znet(hnrcecaldSsh(mnetmh.rceoamdms.)b,u ftfiSdiIzneBsl[oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP)L,E ]g/rNoCuCpL(_gSrToEuPpS)/,s i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o f (| T tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 563 | | group(group stepSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n666c:c9l:S hnote: min instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree m.co 666 | m m . bpurfifmSsi(zteisd[,N CnCTLh_rPeRaOdTsOG_aStIhMePrL,E ]d/iNrCeCcLt_-S>TuEpP,S /NsUiLzLe,o fa(rTg)s)- >{s e n| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~b u f| f group(group, args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:v655b:u11f:f ,note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here prim s202( | t i d - t i d S tRaurntWRoerdkuEclee,m ennTthe(c)t.-r>uonu(tw,e )a;r g s| - ^> sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppu:f6f:,1 :a rnote: gin instantiation of member function 'RunWork, 2, 2>::run' requested heres ->re c6v | bIuMfPfL,_ C O| L ^L _FUNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:R202e:d53u:c enote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here COLL N202E | T _ D I R E C T ,R uSnIWMoPrLkEE,l ePmreondt,< Finn,t 3T2,_ tR)e d O| p^, Alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:,391 :P95r:o tnote: oexpanded from macro 'IMPL_COLL_FUNC'> ().run (391w | e ) ;R u n| W ^o rk, 2, 2>::run' requested herec , ty p6e | ,I MFPuLn_cC#O#LdLe_vFrUeNdCo(pAu,c eN,C CCLO_LALLNGEOT__#D#IaRlEgCoT,, NSCICMLP_LPER,O TPOr_o#d#,p riontto3>2(_)t.)r u n| (^& ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:m391e:m95.:w onote: rexpanded from macro 'IMPL_COLL_FUNC'k ); \ | ^391 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:W562o:r15k:< nnote: cfield 'nthreads' will be initialized after field 'tidInBlock'c lFunc #562# | f u n c ,t itdy(ptei,d )F,u nnct#h#rdeeavdrse(dnotph),, NtCiCdLI_nABLlGoOc_k#(#tahlrgeoa,d INdCxC.Lx_)P,R OgTrOo_u#p#(pgrrootuop>)(,) . r| u ^~~~~~~~~~~~~~~~~n (&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S60h:m enote: mfield 'group' will be initialized after field 'stepSize'. work) ;562 | \ | ^t id(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~h readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562T:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s Reduce, direct->down, & d562i | r e c t -t>iodu(tt,i da)r,g sn-t>hsreenaddbsu(fnft,h raeragdss-)>,r etcivdbIunfBfl,o c k| ( ^t hreadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:.202x:)53,: gnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo up(g r202o | u p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)W orkEl e563m | e n t < Fsnt,e pTS,i zRee(dnOcpc,l SAhlmgeom,. cPormomt.ob>u(f)f.Sriuzne(sw[eN)C;C L _| P ^R OTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppP:L6E:]1/:N Cnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _STE P6S | /IsMiPzLe_oCfO(LTL)_)F U{N C (| A ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l l R| e group(groupd uce, COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hN:E666T:_9D:I Rnote: Ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC T, SI M666P | L E , P r o d ,p riinmts3(2t_itd), n| T^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:G391a:t95h:e rnote: ,expanded from macro 'IMPL_COLL_FUNC' direc t391- | > u pR,u nNWUoLrLk,< nacrcglsF-u>nsce#n#dfbuunfcf,, tayrpges,- >Fruenccv#b#udfefv,r e d| o ^p ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :N202C:C53L:_ Anote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereG O_## a202l | g o , N C C L _RPuRnOWToOr_k#E#lpermoetnot><(F)n.,r uTn,( &RnecdcOlpS,h mAelmg.ow,o rPkr)o;t o\> ( )| . ^r un(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:e562):;15 : | note: ^field 'nthreads' will be initialized after field 'tidInBlock' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp | : 6 : 1 :t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested here( tid), 6n | tIhMrPeLa_dCsO(LnLt_hFrUeNaCd(sA)l,l RteidduIcneB,l oCcOkL(LtNhErTe_aDdIIRdExC.Tx,) ,S IgMrPoLuEp,( gPrrooudp,) ,i n t| 3 ^~~~~~~~~~~~~~~~~2 _t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h): 562 :| 60^: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::20215::53 :warning: initializer order does not match the declaration order [-Wreorder-ctor]note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 562 | R utniWdo(rtkiEdl)e,m enntthk(()t.hrruena(dwIed)x;. x )| , ^ group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp(:g6r:o1u:p )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 6| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I MPL_C O563L | L _ F U NsCt(eAplSliRzeed(unccec,l SChOmLeLmN.EcTo_mDmI.RbEuCfTf,S iSzIeMsP[LNEC,C LP_rPoRdO,T Oi_nStI3M2P_LtE)] / N| C^C L_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:S391/:s95i:z enote: oexpanded from macro 'IMPL_COLL_FUNC'f (T)) { 391 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R u| n group(groupW ork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Func## d626e | v r e d o p < t ypprei>m,s (NtCiCdL-_tAiLdGSOt_a#r#taSlcgaot,t eNrC,C Ln_TPhRrOeTaOd_s#S#cpartotteor>,( )N.UrLuLn,( &dnicrcelcSth-m>eump.,w oarrkg)s;- >\s e n| d ^b uff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :a562r:g15s:- >note: rfield 'nthreads' will be initialized after field 'tidInBlock'e cvbuf f562, | | ^ tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 :n53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea ds(n t202h | r e a d s ) , tRiudnIWnoBrlkoEclke(mtehnrte ^~~~~~~~~~~~~~~~~( )./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:u562n:(60w:e )note: ;field 'group' will be initialized after field 'stepSize' | 562 ^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppi:d8(:t1i:d )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here nthr e8a | dIsM(PnLt_hCrOeLaLd_sF)U,N Ct(iAdlIlnRBeldouccke(,t hCrOeLaLdNIEdTx_.DxI)R,E CgTr,o uSpI(MgPrLoEu,p )P,r o d| , ^~~~~~~~~~~ int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ C(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'group' will be initialized after field 'stepSize': 15: warning: initializer order does not match the declaration order [-Wreorder-ctor]562 | tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pgroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;15 :\ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup), 563| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562e:p60S:i znote: efield 'group' will be initialized after field 'stepSize'( ncclS h562m | e m . c otmimd.(btuifdf)S,i znetsh[rNeCaCdLs_(PnRtOhTrOe_aSdIsM)P,L Et]i/dNICnCBLl_oScTkE(PtSh/rseiazdeIodfx(.Tx))), {g r o| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p ( g| r group(groupo up), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~~~~~~~: 666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:s562e:n15d:b uwarning: finitializer order does not match the declaration order [-Wreorder-ctor]f , args->r e562c | v b u f ft,i d (| t ^i d), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s (note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hrea d202s | ) , t i d I n BRluoncWko(rtkhErleeamdeIndtx<.Fxn),, Tg,r oRuepd(Ogpr,o uApl)g,o , | P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o t| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)> ().ru n563( | w e ) ; s t| e ^p Size(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppn:c8c:l1S:h mnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herem .com m8. | bIuMfPfLS_iCzOeLsL[_NFCUCNLC_(PARlOlTROe_dSuIcMeP,L EC]O/LNLCNCELT__SDTIERPESC/Ts,i zSeIoMfP(LTE),) P{r o d| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ i n| t group(group6 4_t) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :626:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 95: note: expanded from macro 'IMPL_COLL_FUNC' 626 | 391 | RpurniWmosr(kt ,d iNrCeCcLt_-A>LuGpO,_ #a#raglsg-o>,s eNnCdCbLu_fPfR,O TaOr_g#s#-p>rroetcov>b(u)f.fr,u n (| & ^n cclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:e202m:.53w:o rnote: kin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) ; \ 202| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :R15u:n Wnote: ofield 'nthreads' will be initialized after field 'tidInBlock'r kElem e562n | t < F n ,t iTd,( tRiedd)O,p ,n tAhlrgeoa,d sP(rnotthor>e(a)d.sr)u,n (twied)I;n B l| o ^c k(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppa:d7I:d1x:. xnote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here, gro u7p | (IgMrPoLu_pC)O,L L _| F ^~~~~~~~~~~~~~~~~U NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:A562l:l60R:e dnote: ufield 'group' will be initialized after field 'stepSize'c e, CO L562L | N E T _ DtIiRdE(CtTi,d )S,I MnPtLhEr,e aPdrso(dn,t hurienatd3s2)_,t )t i d| I^n Bloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hk:(391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC'I dx.x), 391g | r o uRpu(ngWroorukp<)n,c c l| F ^~~~~~~~~~~u nc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s ), tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~t idI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o60c:k (note: tfield 'group' will be initialized after field 'stepSize'h read I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads) ,563 | t i d I nsBtleopcSki(zteh(rnecacdlISdhxm.exm).,c ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ^~~~~~~~~~~L _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsSc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreaatdtse)r,, tNiUdLILn,B ldoicrke(ctth-r>euapd,I daxr.gxs)-,> sgernodubpu(fgfr,o uapr)g,s - >| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e c v| b tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u ff, | ^563 | stepSize(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:h202m:e53m:. cnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem m.buff S202i | z e s [ N C C L _RPuRnOWToOr_kSEIlMePmLeEn]t/ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( ) .| r group(groupu n(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppnote: :in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here9 :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here641 | 9 | I M P Lp_rCiOmLsL(_tFiUdN-Ct(iAdlSltRaerdtuRceed,u cCeO,L LnNTEhTr_eDaIdRsERCeTd,u cSeI,M PdLiEr,e cPtr-o>dd,o wuni,n t&6d4i_rte)c t -| >^o ut, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g391s:-95>:s enote: nexpanded from macro 'IMPL_COLL_FUNC'd buff, a391r | g s -R>urneWcovrbku, 2, 2>::run' requested hereF unc# #202d | e v r e d o p < tRyupneW>o,r kNEClCeLm_eAnLtG().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(ntedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | Rhreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655687 | | pprriimmss((ttiidd--ttiiddSSttaarrttRBecdauscte,, nnTThhrreeaaddssBRceadsutc,e ,& dniurlelcptt-r>,o u&td,i rneucltl-p>toru,t ,a ragrsg-s>-s>esnednbdubfuff,f ,a rgasr-g>sr-e>crvebcuvfbfu,f f ,| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202::20253::53 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R uRnuWnoWrokrEklEelmeemnetno(>)(.)r.urnu(nw(ew)e;) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp::77::11:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 77 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, CCOOLLLLNNEETT__DDIIRREECCTT,, SSIIMMPPLLEE,, PPrroodd,, uuiinntt3322__tt)) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391391::9595:: note: note: expanded from macro 'IMPL_COLL_FUNC'expanded from macro 'IMPL_COLL_FUNC' 391391 | | RRuunnWWoorrkk<>,, NNCCCCLL__AALLGGOO__####aallggoo,, NNCCCCLL__PPRROOTTOO__####pprroottoo>>(())..rruunn((&&nnccccllSShhmmeemm..wwoorrkk));; \\ | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6060:: note: note: field 'group' will be initialized after field 'stepSize'field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement (t)i.dr(utni(dw)e,) ;n t h| r ^e ads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:h8r:e1a:d snote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here, tid I8n | BIlMoPcLk_(CtOhLrLe_aFdUINdCx(.Axl)l,R egdruocuep,( gCrOoLuLpN)E,T _ D| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R E C| T tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), SIMP L563E | , P r osdt,e piSnitz6e4(_ntcclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork(,t iNdC)C,L _nAtLhGrOe_a#d#sa(lngtoh,r eNaCdCsL)_,P RtOiTdOI_n#B#lporcokt(ot>h(r)e.arduInd(x&.nxc)c,l Sghrmoeump.(wgorroku)p;) ,\ | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56315 | : note: field 'nthreads' will be initialized after field 'tidInBlock' stepS i562z | e ( n c ctliSdh(mteimd.)c,o mnmt.hbruefafdSsi(znetsh[rNeCaCdLs_)P,R OtTiOd_ISnIBMlPoLcEk](/tNhCrCeLa_dSITdExP.Sx/)s,i zgeroofu(pT()g)r o{u p )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 687note: :field 'group' will be initialized after field 'stepSize'11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 687 | t i d ( t i d ) , nptrhirmesa(dtsi(dn-tthirdeSatdasr)t,B ctaisdtI,n BnlTohcrke(atdhsrBecaadsItd,x .&xd)i,r egcrto-u>po(ugtr,o unpu)l,l p t| r ^~~~~~~~~~~, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:S562I:M15P:L Ewarning: ]initializer order does not match the declaration order [-Wreorder-ctor]/ NCCL_STEPS/ s562i | z e o f (tTi)d)( t{i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h group(groupr eads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,626 :t9i:d Inote: nin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereB lock(t h626r | e a d I d x . x )p,r igmrso(utpi(dg-rtoiudpS)t,a r t| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c a t| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e r, n T563h | r e a d ssStceaptStiezre,( nNcUcLlLS,h mdeimr.eccotm-m>.ubpu,f faSrigzse-s>[sNeCnCdLb_uPfRfO,T Oa_rSgIsM-P>LrEe]c/vNbCuCfLf_,S T E| P ^S /sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:(202T:)53): {note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 202 group(group | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562 | : 562 :t15i:d (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d), nthreads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d I| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x .x), g r563o | u p ( g rsotuepp)S,i z e| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n c c| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S hmem .563c | o m m . bsutfefpSSiizzees([nNcCcClLS_hPmReOmT.Oc_oSmImM.PbLuEf]f/SNiCzCeLs_[SNTCECPLS_/PsRiOzTeOo_fS(ITM)P)L E{] / N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| S group(groupT EPS/sizeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :{655 : 11| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| group(group 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677p:r11i:m snote: (in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret id-t i677d | S t a r t R e d u c ep,r inmTsh(rteiadd-stRieddSutcaer,t Bncualsltp,t rn,T h&rdeiardescBtc-a>sotu,t ,& dairrgesc-t>-s>eonudtb,u fdfi,r eacrtg-s>-d>orwenc,v baurfgfs,- > s| e ^n dbuff, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:s202-:>53r:e cnote: vin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereb uff, 202 | | ^ R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n202W:o53r:k Enote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ment <202F | n , T , R e dROupn,W oArlkgEol,e mPernottn(,) .Tr,u nR(ewdeO)p;, A| l ^g o, Prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppo:>9(:)1.:r unote: nin instantiation of member function 'RunWork, 2, 2>::run' requested here( we); 9 | | I ^M PL_COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppL:_9F:U1N:C (note: Ain instantiation of member function 'RunWork, 2, 2>::run' requested herel lRe d9u | cIeM,P LC_OCLOLLNLE_TF_UDNICR(EAClTl,R eSdIuMcPeL,E ,C OPLrLoNdE,T _uDiInRtE6C4T_,t )S I M| P^L E, Pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:,391 :u95i:n tnote: 6expanded from macro 'IMPL_COLL_FUNC'4 _t) | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :R391u:n95W:o rnote: kexpanded from macro 'IMPL_COLL_FUNC'< nccl F391u | n c #R#ufnuWnocr,k c,# #NdCeCvLr_eAdLoGpO<_t#y#pael>g,o ,N CNCCLC_LA_LPGROO_T#O#_a#l#gpor,o tNoC>C(L)_.PrRuOnT(O&_n#c#cplrSohtmoe>m(.)w.orrukn)(;& n\c c l| S ^h mem.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):;562 :\15 : | note: ^field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(ti d562) | , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~g roup)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 60 ^~~~~~~~~~~~~~~~~: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'group' will be initialized after field 'stepSize': 562:60: note: field 'group' will be initialized after field 'stepSize'562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:)15;: \warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u p| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g roup )563, | | ^~~~~~~~~~~~~~~~~ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e60(:n cnote: cfield 'group' will be initialized after field 'stepSize'l Shmem .562c | o m m . btuifdf(Stiizde)s,[ NnCtChLr_ePaRdOsT(On_tShIrMePaLdEs])/,N CtCiLd_ISnTBElPoSc/ks(itzheroefa(dTI)d)x .{x ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp (group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 641 :| 11 ^~~~~~~~~~~: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562u:i15n:t 6warning: 4initializer order does not match the declaration order [-Wreorder-ctor]_ t) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), n t391h | r e aRdusn(Wnotrhkrp,( gNrCoCuLp_)A,L G O| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~# # a| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g o, NC C563L | _ P R O TsOt_e#p#Spirzoet(on>c(c)l.Srhumne(m&.nccocmlmS.hbmuefmf.Swiozreks)[;N C\C L _| P ^R OTO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E15]:/ Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_STEP S562/ | s i z e otfi(dT()t)i d{) , | n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d655s:)11,: tnote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered InBlo c655k | ( t h r e a d I d x .pxr)i,m sg(rtoiudp-(tgirdoSutpa)r,t R e| d ^~~~~~~~~~~~~~~~~u ce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:T562h:r60e:a dnote: sfield 'group' will be initialized after field 'stepSize'R educe ,562 | n u l l pttird,( t&iddi)r,e cntt-h>roeuatd,s (anrtghsr-e>asdesn)d,b utfifd,I naBrlgosc-k>(rtehcrvebaudfIfd,x . x| ) ^, group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u202p:)53,: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)P roto> (563) | . r u n (swtee)p;S i z| e ^( ncclShmem.comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.:b8u:f1f:S inote: zin instantiation of member function 'RunWork, 2, 2>::run' requested heree s[NCC L8_ | PIRMOPTLO__CSOILMLP_LFEU]N/CN(CAClLl_RSeTdEuPcSe/,s iCzOeLoLfN(ETT)_)D I{R E C| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, S| I group(groupM PLE, Pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:,666 :i9n:t 6note: 4in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ t) | ^666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :p95r:i mnote: sexpanded from macro 'IMPL_COLL_FUNC'( tid, n T391h | r e aRdusnGWaotrhkefuupn,c ,N UtLyLp,e ,a rFgusn-c>#s#ednedvbruefdfo,p >,r eNcCvCbLu_fAfL,G O _| # ^# algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C202C:L53_:P Rnote: Oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereT O_## p202r | o t o > ( ) . r uRnu(n&WnocrcklESlhemmeemn.twfield 'nthreads' will be initialized after field 'tidInBlock'( ).run( w562e | ) ; | t ^i d(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :n9t:h1r:e anote: din instantiation of member function 'RunWork, 2, 2>::run' requested heres (nth r9e | aIdMsP)L,_ CtOiLdLI_nFBUlNoCc(kA(ltlhRreedaudcIed,x .CxO)L,L NgErTo_uDpI(RgErCoTu,p )S,I M P| L ^~~~~~~~~~~~~~~~~E , Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:d562,: 60u:i nnote: tfield 'group' will be initialized after field 'stepSize'6 4_t) 562| | ^ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthread s391( | n t hRruenaWdosr)k,< ntcicdlIFnuBnlco#c#kf(utnhcr,e atdyIpdex,. xF)u,n cg#r#oduepv(rgerdooupp<)t,y p e| > ^~~~~~~~~~~, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | R unWo563r | k< n c c lsFtuenpcS#i#zfeu(nncc,c ltSyhpmee,m .Fcuonmcm#.#bduefvfrSeidzoeps<[tNyCpCeL>_,P RNOCTCOL__SAILMGPOL_E#]#/aNlCgCoL,_ SNTCECPLS_/PsRiOzTeOo_f#(#Tp)r)o t{o > (| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. r u| n group(group( &ncclShmem.work); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h\: 687 :| 11 ^: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :68715 | : note: field 'nthreads' will be initialized after field 'tidInBlock' p562r | i m s ( ttiidd-(ttiiddS)t,a rnttBhcraesatd,s (nnTthhrreeaaddssB)c,a stti,d I&ndBilroecckt(-t>horueta,d Induxl.lxp)t,r ,g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h r:oa562ur:pg15(sg:-r >owarning: uspinitializer order does not match the declaration order [-Wreorder-ctor]e) ,n d b| u ^~~~~~~~~~~~~~~~~f f ,562 | a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h r: g562 s: -60t:> inote: rdfield 'group' will be initialized after field 'stepSize'e( ctvibdu)f, f 562, | n t h| r ^tei da(dtsi(dn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h)t:,h 202rn:et53ah:dr senote: )ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered,s ( ntti hd202rIe | na Bd ls o) c, k (t it dhRIrunenBaWldooIcrdkk(xEt.hlrxee)amd,Ied nxgt.rx | ( ) . r usnt(ewpeS)i;z e (| n ^c clShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppm:.9c:o1m:m .note: bin instantiation of member function 'RunWork, 2, 2>::run' requested hereu ffSi z9e | sI[MNPCLC_LC_OPLRLO_TFOU_NSCI(MAPlLlER]e/dNuCcCeL,_ SCTOELPLSN/EsTi_zDeIoRfE(CTT),) S{I M P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E , | P group(groupr od, uint64_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:)687 : 11| :^ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95 :687 | note: expanded from macro 'IMPL_COLL_FUNC' 391 | p r iRmusn(Wtoirdk-yopuet>,, nNuClClLp_tArL,G Oa_r#g#sa-l>gsoe,n dNbCuCfLf_,P RaOrTgOs_-#>#rpercovtbou>f(f),. r u| n ^( &ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:m202e:m53.:w onote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herek ); \ 202 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:u562n:W15o:r knote: Efield 'nthreads' will be initialized after field 'tidInBlock'l ement< F562n | , T , tRiedd(Otpi,d )A,l gnot,h rPeraodtso(>n(t)h.rreuand(sw)e,) ;t i d| I ^n Block(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:h9r:e1a:d Inote: din instantiation of member function 'RunWork, 2, 2>::run' requested herex .x), 9g | rIoMuPpL(_gCrOoLuLp_)F,U N C| ( ^~~~~~~~~~~~~~~~~A ll/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:e562d:u60c:e ,note: field 'group' will be initialized after field 'stepSize'C OLLNE T562_ | D I R E CtTi,d (StIiMdP)L,E ,n tPhrroeda,d su(inntth6r4e_atd)s ) ,| ^t idIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hB:l391o:c95k:( tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eadIdx .391x | ) , RgurnoWuopr(kg, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]n Block(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~r eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidSta/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h60::562 :note: 15field 'group' will be initialized after field 'stepSize': warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| :^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t i dR(utniWdo)r,k ),, NgCrCoLu_pA(LgGrOo_u#p#)a,l g o| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _PROT O563_ | # # p r osttoe>p(S)i.zreu(nn(c&cnlcSchlmSehmm.ecmo.mwmo.rbku)f;f S\i z e| s ^[ NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O15T:O _note: Sfield 'nthreads' will be initialized after field 'tidInBlock'I MPLE]/ N562C | C L _ S TtEiPdS(/tsiidz)e,o fn(tTh)r)e a{d s (| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:c666k:(9t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea dIdx .666x | ) , g r o u p (pgrriomusp()t,i d ,| ^~~~~~~~~~~~~~~~~n Thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60G:a tnote: hfield 'group' will be initialized after field 'stepSize'e r, di r562e | c t - > utpi,d (NtUiLdL),, anrtghsr-e>asdesn(dnbtuhfrfe,a dasr)g,s -t>irdeIcnvBbluofcfk,( t h| r ^e adIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:x202):,53 :g rnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu p(gr o202u | p ) , | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rtScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~15 : | warning: group(groupinitializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h562: | 655 : 11 : tnote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered (tid), n655t | h r e a d s ( n t h rperaidmss)(,t itdi-dtIindBSltaock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~ cclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rtReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15P:R Owarning: Tinitializer order does not match the declaration order [-Wreorder-ctor]O _SIMPLE]/ N562C | C L _ S TtEiPdS(/tsiidz)e,o fn(tTh)r)e a{d s (| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hB:l666o:c9k:( tnote: hin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eadIdx .666x | ) , g r o u p (pgrriomusp()t,i d ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n T h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e adsG a563t | h e r , sdtierpeScitz-e>(unpc,c lNSUhLmLe,m .acrogmsm-.>bsuefnfdSbiuzfefs,[ NaCrCgLs_-P>RrOeTcOv_bSuIfMfP,L E ]| / ^N CCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/:s202i:z53e:o fnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereT )) { 202| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:l677e:m11e:n tnote: , FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereF n, T, 677R | e d O p , A l g o ,p rPirmost(ot>i(d)-.triudnS(twaer); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hwarning: initializer order does not match the declaration order [-Wreorder-ctor] :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p ), 563| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s tepSiz e563( | n c c l SshtmeepmS.iczoem(mn.cbculfSfhSmiezme.sc[oNmCmC.Lb_uPfRfOSTiOz_eSsI[MNPCLCEL]_/PNRCOCTLO__SSTIEMPPSL/Es]i/zNeCoCfL(_TS)T)E P{S / s| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~z e o| f group(group( T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677 :68711 | : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here p r677i | m s ( t i d - t i d SptrairmtsB(ctaisdt-,t indTShtraeratdBscBacsats,t ,n T&hdrieraedcstB-c>aosutt,, &nduilrlepcttr-,> oaurtg,s -d>isreencdtb-u>fdfo,w na,r gasr-g>sr-e>csvebnudfbfu,f f ,| ^a rgs->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:v202b:u53f:f ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202R:u53n:W onote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herek Eleme n202t | < F n , T , RReudnOWpo,r kAEllgeom,e nPtr (T),. rRuend(Owpe,) ;A l g| o ^, Pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:o10>:(1):. rnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heren (w e10) | ;I M P| L ^_ COLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp(:A8l:l1R:e dnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herec e, CO L8L | NIEMTP_LD_ICROELCLT_,F USNICM(PALlEl,R ePdruocde,, hCaOlLfL)N E T| _^D IREC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:,391 :S95I:M Pnote: Lexpanded from macro 'IMPL_COLL_FUNC'E , Pro d391, | i nRtu6n4W_otr)k < n| c^c lFunc#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:f391u:n95c:, note: texpanded from macro 'IMPL_COLL_FUNC'y pe, Func #391# | d e vRruendWooprl,F uNnCcC#L#_fAuLnGcO,_ #t#yapleg,o ,F uNnCcC#L#_dPeRvOrTeOd_o#p#>,( )N.CrCuLn_(A&LnGcOc_l#S#hamlegmo.,w oNrCkC)L;_ P\R O T| O ^_ ##pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:o562>:(15):. rnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n (&nc c562l | S h m e mt.iwdo(rtki)d;) ,\ n t| h ^r eads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s )note: ,field 'nthreads' will be initialized after field 'tidInBlock' tidInBl o562c | k ( t h rteiadd(Itdixd.)x,) ,n tghrroeuapd(sg(rnotuhpr)e,a d s| ) ^~~~~~~~~~~~~~~~~, ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B60l:o note: cfield 'group' will be initialized after field 'stepSize'k (thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t60i:d Inote: nfield 'group' will be initialized after field 'stepSize'B lock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~e ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkd,) ,N CnCtLh_rAeLaGdOs_(#n#tahlrgeoa,d sN)C,C Lt_iPdRIOnTBOl_o#c#kp(rtohtroe>a(d)I.drxu.nx()&,n cgcrloSuhpm(egmr.owuopr)k,) ; | \ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h563: | 562 : 15 : snote: tfield 'nthreads' will be initialized after field 'tidInBlock'e pSize (562n | c c l S htmiedm(.tciodm)m,. bnutfhfrSeiazdess([nNtChCrLe_aPdRsO)T,O _tSiIdMIPnLBEl]o/cNkC(CtLh_rSeTaEdPISd/xs.ixz)e,o fg(rTo)u)p ({g r o| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p ) ,| group(group | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h562::68760::11 :note: field 'group' will be initialized after field 'stepSize'note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 687 | t i d ( t i d )p,r inmtsh(rteiadd-st(indtShtraeratdBsc)a,s tt,i dnITnhBrleoacdks(Btcharseta,d I&ddxi.rxe)c,t -g>roouutp,( gnruolulpp)t,r , | a ^~~~~~~~~~~r gs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 : Rwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]n WorkElement((n)t.hrruena(dwse)),; t i| d ^I nBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp(:t10h:r1e:a dnote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hered x.x) ,10 | gIrMoPuLp_(CgOrLoLu_pF)U,N C (| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l l R| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d uce, C563O | L L N E Ts_tDeIpRSEiCzTe,( nScIcMlPSLhEm,e mP.rcoodm,m .hbaulfff)S i z| e^s [NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_391P:R95O:T Onote: _expanded from macro 'IMPL_COLL_FUNC'S IMPLE] /391N | C C LR_uSnTWEoPrSk/in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, NCCL_ A626L | G O _ # # a l g op,r iNmCsC(Lt_iPdR-OtTiOd_S#t#aprrtoStcoa>t(t)e.rr,u nn(T&hnrcecaldSshSmceamt.tweorr,k )N;U L\L , | d ^i rect-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:u562p:,15 :a rnote: gfield 'nthreads' will be initialized after field 'tidInBlock's ->sen d562b | u f f , tairdg(st-i>dr)e,c vnbtuhfrfe,a d s| ( ^n thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)202,: 53t:i dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Bloc k202( | t hrea d I d x . x )R,u ngWroorukpE(lgermoeunpt)<,F n ,| ^~~~~~~~~~~~~~~~~T , Re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:O562p:,60 :A lnote: gfield 'group' will be initialized after field 'stepSize'o , Pro t562o | > ( ) . rtuind((wtei)d;) , | n ^t hreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:h9r:e1a:d snote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here, tid I9n | BIlMoPcLk_(CtOhLrLe_aFdUINdCx(.Axl)l,R egdruocuep,( gCrOoLuLpN)E,T _ D| I ^~~~~~~~~~~R ECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcas/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:>562s:e15n:d bwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]f f, args -562> | r e c v btuifdf(,t i d| ) ^, nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(202n:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s), t i202d | I n B l o c k ( tRhurneWaodrIkdExl.exm)e,n tg | ( ) . r usnt(ewpeS)i;z e (| n ^c clShmem.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppo:m11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:s562e:n15d:b uwarning: finitializer order does not match the declaration order [-Wreorder-ctor]f , args-> r562e | c v b u ftfi,d ( t| i ^d ), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:(53n:t hnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ads), 202t | i d I n B l o c kR(utnhWroerakdEIldexm.exn)t,< Fgnr,o uTp,( gRreoduOpp),, A l| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o , | P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oto> (563) | . r u n (swtee)p;S i z| e ^( ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppe:m10.:c1o:m mnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested hereb uffS i10z | eIsM[PNLC_CCLO_LPLR_OFTUON_CS(IAMlPlLREe]d/uNcCeC,L _CSOTLELPNSE/Ts_iDzIeRoEfC(TT,) )S I{M P L| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, P| r group(groupo d, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h626:391::959:: note: note: expanded from macro 'IMPL_COLL_FUNC'in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391626 | | R u n W porrikmL,L ,N CdCiLr_eAcLtG-O>_u#p#,a lagrog,s -N>CsCeLn_dPbRuOfTfO,_ #a#rpgrso-t>or>e(c)v.bruufnf(,& n c| c ^l Shmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hw:o202r:k53):; note: \in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 :R unote: nfield 'nthreads' will be initialized after field 'tidInBlock'W orkEl e562m | e n t < Ftni,d (Tt,i dR)e,d Onpt,h rAelagdos,( nPtrhorteoa>d(s)).,r utni(dwIen)B;l o c| k ^( threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:x11.:x1):, note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup (11g | rIoMuPpL)_,C O L| L ^~~~~~~~~~~~~~~~~_ FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:(562A:l60l:R enote: dfield 'group' will be initialized after field 'stepSize'u ce, C562O | L L N E Tt_iDdI(RtEiCdT),, SnItMhPrLeEa,d sP(rnotdh,r efaldosa)t,) t i| d^I nBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k391(:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd Idx.x )391, | g rRouunpW(ogrrko, 562N | C C L _ AtLiGdO(_t#i#da)l,g on,t hNrCeCaLd_sP(RnOtThOr_e#a#dpsr)o,t ot>i(d)I.nrBulno(c&kn(ctchlrSehamdeImd.xw.oxr)k,) ;g r\o u p| ( ^g roup)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 15 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: note: | field 'nthreads' will be initialized after field 'tidInBlock' tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562563 | | tsitde(ptSiidz)e,( nnctchlrSehamdesm(.nctohmrme.abdusf)f,S itziedsI[nNBClCoLc_kP(RtOhTrOe_aSdIIMdPxL.Ex])/,N CgCrLo_uSpT(EgPrSo/uspi)z,e o f| ( ^~~~~~~~~~~~~~~~~T )) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h{: 562 :| 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | field 'group' will be initialized after field 'stepSize' group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655 : 11t:i dnote: (in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret id), n655t | h r e a d s ( n t h rperaidmss)(,t itdi-dtIindBSltoacrkt(Rtehdruecaed,I dnxT.hxr)e,a dgsrRoeudpu(cger,o unpu)l,l p t| r ^~~~~~~~~~~, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here562 | 202 | t i d ( t i d ) ,R unntWhorrekaEdlse(mnetnhtrx(.)x.)r,u ng(rwoeu)p;( g r| o ^u p), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~11 : 1| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 56311 | | I M P Ls_tCeOpLSLi_zFeU(NnCc(cAllSlhRmeedmu.cceo,m mC.ObLuLfNfESTi_zDeIsR[ENCCTC,L _SPIRMOPTLOE_,S IPMrPoLdE,] /fNlCoCaLt_)S T E| P^S /siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:391o:f95(:T )note: )expanded from macro 'IMPL_COLL_FUNC' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 391 | | group(group RunWork, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #func ,655 | t y p e , F u n c #p#rdiemvsr(etdiodp-a,r tNRCeCdLu_cAeL,G On_T#h#raelagdos,R eNdCuCcLe_,P RnOuTlOl_p#t#rp,r o&tdoi>r(e)c.tr-u>no(u&tn,c calrSghsm-e>ms.ewnodrbku)f;f ,\ a r| g ^s ->rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hv:b562u:f15f:, note: field 'nthreads' will be initialized after field 'tidInBlock'| ^ 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202(:t53i:d )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here nthre a202d | s ( n t h r e a dRsu)n,W otrikdEIlneBmleonctk<(Ftnh,r eTa,d IRdexd.Oxp),, Aglrgoou,p (Pgrrootuop>)(,) . r| u ^~~~~~~~~~~~~~~~~n (we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):;562 : 60| : ^ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp562: | 10 : 1 : tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered (tid) ,10 | nItMhPrLe_aCdOsL(Ln_tFhUrNeCa(dAsl)l,R etdiudcIen,B lCoOcLkL(NtEhTr_eDaIdRIEdCxT.,x )S,I MgPrLoEu,p (Pgrroodu,p )h,a l f| ) ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562t:B15c:a swarning: tinitializer order does not match the declaration order [-Wreorder-ctor], nThreads B562c | a s t , t&iddi(rteicdt)-,> onutth,r enaudlsl(pnttrh,r eaardgss)-,> steinddIbnuBflfo,c ka(rtghsr-e>ardeIcdvxb.uxf)f,, g r| o ^u p(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 : 53| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 202 | 563 | s tReupnSWiozrek(EnlcecmleSnhtmT(O)_.SrIuMnP(LwEe])/;N C C| L ^_ STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp/:s9i:z1e:o fnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested hereT )) {9 | I| M ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P L _| C group(groupO LL_FUNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:l687R:e11d:u cnote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, COLLN E687T | _ D I R E C T , S IpMrPiLmEs,( tPirdo-dt,i duSitnatr6t4B_cta)s t ,| ^n Thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391s:B95c:a snote: texpanded from macro 'IMPL_COLL_FUNC', &dir e391c | t - >RouuntW,o rnkunsce,n dtbyupfef,, Faurngcs#-#>dreevcrvebduofpf<,t y p| e ^> , NCCL_ALGO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:#202#:a53l:g onote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here NCCL_ P202 | R O T O _ # #RpurnoWtoor>k(E)l.ermuenn(t&(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):.562r:u15n:( wnote: efield 'nthreads' will be initialized after field 'tidInBlock') ; | ^562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:i9d:)1,: nnote: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh reads (9n | tIhMrPeLa_dCsO)L,L _tFiUdNICn(BAllolcRke(dtuhcree,a dCIOdLxL.NxE)T,_ DgIrRoEuCpT(,g rSoIuMpP)L,E , | P ^~~~~~~~~~~~~~~~~r od,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :u562i:n60t:6 4note: _field 'group' will be initialized after field 'stepSize't ) | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t391i:d95(:t inote: dexpanded from macro 'IMPL_COLL_FUNC') , nthrea d391s | ( n tRhurneWaodrsk)<,n ctcildFIunnBcl#o#cfku(ntch,r etaydpIed,x .Fxu)n,c #g#rdoeuvpr(egdroopu| , ^~~~~~~~~~~ NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t id), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(gr o563u | p ) , s| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e p S| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)z e(nc c563l | S h m e ms.tceopmSmi.zbeu(fnfcScilzSehsm[eNmC.CcLo_mPmR.ObTuOf_fSSIiMzPeLsE[]N/CNCCLC_LP_RSOTTEOP_SS/IsMiPzLeEo]f/(NTC)C)L _{S T E| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S / s| i group(groupz eof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~666 : 9| : group(group note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h666: | 666 : 9 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here pri m666s | ( t i d , n T hprreiamdss(Gtaitdh,e rn,T hdrieraedcstG-a>tuhpe,r ,N UdLiLr,e catr-g>su-p>,s eNnUdLbLu,f fa,r gasr-g>ss-e>nrdebcuvfbfu,f fa,r g s| - ^> recvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :202 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R u n RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->se/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads), tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~t id/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o:c knote: (field 'group' will be initialized after field 'stepSize't hrea d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads), t563i | d I n B lsotcekp(Stihzree(andcIcdlxS.hxm)e,m .gcroomump.(bgurfofuSpi)z,e s [| N ^~~~~~~~~~~C CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ndbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562c:t15-:> dwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]w n, &direc t562- | > o u t ,t iadr(gtsi-d>)s,e nndtbhurfefa,d sa(rngtsh-r>eraedcsv)b,u ftfi,d I n| B ^l ock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I202d:x53.:x )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here grou p202( | g r o u p ) , R| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n W o| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k Eleme n563t | < F n , sTt,e pRSeidzOep(,n cAcllgSoh,m ePmr.octoom>m(.)b.urfufnS(iwzee)s;[ N C| C ^L _PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppS:I9M:P1L:E ]note: /in instantiation of member function 'RunWork, 2, 2>::run' requested hereN CCL_ S9T | EIPMSP/Ls_iCzOeLoLf_(FTU)N)C ({A l l| R ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e d u| c group(groupe , COLLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:D687I:R11E:C Tnote: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here SIMPLE ,687 | P r o d , u i n t 6p4r_itm)s ( t| i^d -tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:t391a:r95t:B cnote: aexpanded from macro 'IMPL_COLL_FUNC's t, nTh r391e | a d sRBucnaWsotr,k <&ndcicrleFcutn-c>#o#uftu,n cn,u ltlyppter,, Faurngcs#-#>dseevnrdebduofpf<,t yapreg>s,- >NrCeCcLv_bAuLfGfO,_ # #| a ^l go, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_202P:R53O:T Onote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here# #pro t202o | > ( ) . r u n ( &RnucncWloSrhkmEelme.mweonrtk<)F;n ,\ T ,| ^R edOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :A562l:g15o:, note: Pfield 'nthreads' will be initialized after field 'tidInBlock'r oto>( )562. | r u n ( wtei)d;( t i| d ^) , nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppa:d9s:(1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads) ,9 | tIiMdPILn_BClOoLcLk_(FtUhNrCe(aAdlIldRxe.dxu)c,e ,g rCoOuLpL(NgErTo_uDpI)R,E C T| , ^~~~~~~~~~~~~~~~~ SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562E:,60 :P rnote: ofield 'group' will be initialized after field 'stepSize'd , uin t5626 | 4 _ t ) t i| d^( tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391n:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd s(nthr e391a | d s )R,u ntWiodrIkn, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a dIdx.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here s->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N C562C | L _ A L GtOi_d#(#taildg)o,, nNtChCrLe_aPdRsO(TnOt_h#r#epardost)o,> (t)i.drIunnB(l&oncckc(ltShhrmeeamd.Iwdoxr.kx));, \g r o| u ^p (gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562):,15 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~field 'nthreads' will be initialized after field 'tidInBlock' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i ds(tteipdS)i,z en(tnhcrcelaSdhsm(enmt.hcroemamd.sb)u,f ftSiidzIensB[lNoCcCkL(_tPhRrOeTaOd_ISdIxM.PxL)E,] /gNrCoCuLp_(SgTrEoPuSp/)s,i z e| o ^~~~~~~~~~~~~~~~~f (T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h): 562{: 60 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~note: field 'group' will be initialized after field 'stepSize' | group(group 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:t666i:d9):, note: nin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hread s666( | n t h r e a d s )p,r itmisd(ItniBdl,o cnkT(htrheraedasdGIadtxh.exr),, dgirroeucpt(-g>ruopu,p )N,U L L| , ^~~~~~~~~~~ args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hads(n:t562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]) , tidInBlock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Block (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidIn B563l | o c k ( tshtreepaSdiIzdex(.nxc)c,l Sghrmoeump.(cgormomu.pb)u,f f S| i ^~~~~~~~~~~z es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:ROTO_##562p:r15o:t owarning: >initializer order does not match the declaration order [-Wreorder-ctor]( ).run(&n c562c | l S h m etmi.dw(otrikd));, \n t h| r ^e ads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:) ,note: field 'nthreads' will be initialized after field 'tidInBlock't idInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I nBloc k563( | t h r e asdtIedpxS.ixz)e,( ngcrcoluSph(mgermo.ucpo)m,m . b| u ^~~~~~~~~~~~~~~~~f fSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562s:[60N:C Cnote: Lfield 'group' will be initialized after field 'stepSize'_ PROT O562_ | S I M P LtEi]d/(NtCiCdL)_,S TnEtPhSr/esaidzse(onft(hTr)e)a d{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e666a:d9I:d xnote: .in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex ), gr o666u | p ( g r o u p ) ,p r i| m ^~~~~~~~~~~s (tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorki,d )N,C CnL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562o:t15o:> (warning: )initializer order does not match the declaration order [-Wreorder-ctor]. run(&ncc l562S | h m e m .twiodr(kt)i;d )\, n| t ^h reads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: )field 'nthreads' will be initialized after field 'tidInBlock', tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B lock( t563h | r e a d Isdtxe.pxS)i,z eg(rnocucpl(Sghrmoeump.)c,o m m| . ^~~~~~~~~~~~~~~~~b uff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e60s:[ Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_PRO T562O | _ S I M PtLiEd](/tNiCdC)L,_ SnTtEhPrSe/asdisz(enotfh(rTe)a)d s{) , | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d I| n group(groupB lock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a677d:I11d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , gro u677p | ( g r o u p ) , | p ^~~~~~~~~~~r ims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIM,P LNEU]L/LN,C CaLr_gSsT-E>PsSe/nsdibzuefoff,( Ta)r)g s{- > r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c v b| u group(groupf f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :687 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | p r i m s ( t i dR-utniWdoSrtkaErlteBmceanstt<,F nn,T hTr,e aRdesdBOcpa,s tA,l g&od,i rPercott-o>>o(u)t.,r unnu(lwlep)t;r , | a ^r gs->sen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:b11u:f1f:, note: ain instantiation of member function 'RunWork, 2, 2>::run' requested herer gs-> r11e | cIvMbPuLf_fC,O L L| _ ^F UNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:R202e:d53u:c enote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here COLL N202E | T _ D I R E C T ,R uSnIWMoPrLkEE,l ePmreondt,< Ffnl,o aTt,) R e| d^O p, Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:o391,: 95P:r onote: texpanded from macro 'IMPL_COLL_FUNC'o >().run (391w | e ) ;R u n| W ^o rk, 2, 2>::run' requested heref unc, 12t | yIpMeP,L _FCuOnLcL#_#FdUeNvCr(eAdlolpR,, CNOCLCLLN_EATL_GDOI_R#E#CaTl,g oS,I MNPCLCEL,_ PPRrOoTdO,_ #d#opurbolteo)> ( )| .^r un(&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:c391c:l95S:h mnote: eexpanded from macro 'IMPL_COLL_FUNC'm .work) ;391 | \ R| u ^n Work(,n tNhCrCeLa_dAsL)G,O _t#i#daIlngBol,o cNkC(CtLh_rPeRaOdTIOd_x#.#xp)r,o tgor>o(u)p.(rgurno(u&pn)c,c l S| h ^~~~~~~~~~~~~~~~~m em/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:w562o:r60k:) ;note: field 'group' will be initialized after field 'stepSize'\ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o15u:p (warning: ginitializer order does not match the declaration order [-Wreorder-ctor]r oup), | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t60i:d (note: tfield 'group' will be initialized after field 'stepSize'i d), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :g 13r| :o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)1u :p (note: gin instantiation of member function 'RunWork, 2, 2>::run' requested here563r | o u p ) ,13s | t Ie| Mp ^~~~~~~~~~~PS Li_zCeO(LnLc_cFlUSNhCm(eAml.lcRoemdmu.cbeu,f fCSOiLzLeNsE[TN_CDCILR_EPCRTO,T OS_ISMIPMLPEL,E ]P/rNoCdC,L _rScTcElP_Sb/fsliozaeto1f6()T ) )| ^{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : group(group391 :95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: 391note: | in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunW o666r | k < n c c l F u npcr#i#mfsu(ntci,d ,t ynpTeh,r eFaudnscG#a#tdheevrr,e ddoipr>u,p ,N CNCULL_LA,L GaOr_g#s#-a>lsgeon,d bNuCfCfL,_ PaRrOgTsO-_>#r#epcrvobtuof>f(,) . r| u ^n (&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:m202e:m53.:w onote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herek ); \202 | | ^ RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho::r562562k::E1515l::e mnote: warning: efield 'nthreads' will be initialized after field 'tidInBlock'initializer order does not match the declaration order [-Wreorder-ctor]n tea(ad)ds.s(r(nuntnthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :expanded from macro 'IMPL_COLL_FUNC'562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor]391 | RunWork< n562c | c l F u ntci#d#(ftuindc),, tnytpher,e aFdusn(cn#t#hdreevardesd)o,p B,l oNcCkC(Lt_hArLeGaOd_I#d#xa.lxg)o,, gNrCoCuLp_(PgRrOoTuOp_)#,# p r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t o >| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) .run(& n563c | c l S h msetme.pwSoirzke)(;n c\c l S| h ^m em.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:m562m:.15b:u fnote: ffield 'nthreads' will be initialized after field 'tidInBlock'S izes [562N | C C L _ PtRiOdT(Ot_iSdI)M,P LnEt]h/rNeCaCdLs_(SnTtEhPrSe/asdisz)e,o ft(iTd)I)n B{l o c| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t h| r group(groupe adIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641| : ^~~~~~~~~~~~~~~~~11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: 641field 'group' will be initialized after field 'stepSize' | 562 | p rtiimds((ttiidd)-,t indtShtraeratdRse(dnutcher,e andTsh)r,e atdisdRIendBulcoec,k (dtihrreecatd-I>ddxo.wxn),, &gdrioruepc(tg-r>oouupt),, a r| g ^~~~~~~~~~~s ->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l15o:c kwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t hreadIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~e ads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Block(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~[ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree dop, N C626C | L _ A L G O _ # #parligmos,( tNiCdC-Lt_iPdRSOtTaOr_t#S#cpartotteor>,( )n.Trhurne(a&dnscScclaSthtmeerm,. wNoUrLkL),; d\i r e| c ^t ->up, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:s562-:>15s:e nnote: dfield 'nthreads' will be initialized after field 'tidInBlock'b uff, a562r | g s - > rteicdv(btuifdf),, n| t ^h reads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t202i:d53I:n Bnote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo ck(th r202e | a d I d x . x ) ,R ugnrWoourpk(Eglreomuepn)t,< F n| , ^~~~~~~~~~~~~~~~~ T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562O:p60,: Anote: lfield 'group' will be initialized after field 'stepSize'g o, Pr o562t | o > ( ) .triudn((twied));, n| t ^h reads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:h11r:e1a:d snote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here, tid I11n | BIlMoPcLk_(CtOhLrLe_aFdUINdCx(.Axl)l,R egdruocuep,( gCrOoLuLpN)E,T _ D| I ^~~~~~~~~~~R ECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562h:m15e:m .warning: winitializer order does not match the declaration order [-Wreorder-ctor]o rk); \ 562| | ^ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i15d:) ,note: field 'nthreads' will be initialized after field 'tidInBlock'n threa d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~x . x| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), grou p563( | g r o u ps)t,e p S| i ^~~~~~~~~~~~~~~~~z e(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S60h:m enote: mfield 'group' will be initialized after field 'stepSize'. comm. b562u | f f S i zteisd[(NtCiCdL)_,P RnOtThOr_eSaIdMsP(LnEt]h/rNeCaCdLs_)S,T EtPiSd/IsniBzleoocfk((Tt)h)r e{a d I| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x . x| ) group(group, group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 666 ^~~~~~~~~~~: 9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ go, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:a562l:g15o:, warning: Ninitializer order does not match the declaration order [-Wreorder-ctor]C CL_PROT O562_ | # # p r ottiod>((t)i.dr)u,n (n&tnhcrcelaSdhsm(enmt.hwroerakd)s;) ,\ t i| d ^I nBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock'I dx.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a ds(nt h563r | e a d s )s,t etpiSdiIzneB(lnoccckl(Sthhmreema.dcIodmxm..xb)u,f fgSriozueps([gNrCoCuLp_)P,R O T| O ^~~~~~~~~~~~~~~~~_ SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:_15b:f lwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]a t16) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nt h391r | e a dRsu(nnWtohrrke),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ A| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)G O_##al g563o | , N C CsLt_ePpRSOiTzOe_(#n#cpcrloSthom>e(m)..croumnm(.&bnucfcflSSihzmeesm[.NwCoCrLk_)P;R O\T O _| S ^I MPLE]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15S:T Enote: Pfield 'nthreads' will be initialized after field 'tidInBlock'S /size o562f | ( T ) ) t{i d (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d )| , group(group nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r626e:a9d:s )note: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tidInBl o626c | k ( t h r e a d Ipdrxi.mxs)(,t igdr-otuipd(SgtraorutpS)c,a t t| e ^~~~~~~~~~~~~~~~~r , n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:h562r:e60a:d snote: Sfield 'group' will be initialized after field 'stepSize'c atte r562, | N U L Lt,i dd(itriedc)t,- >nutph,r eaardgss(-n>tshendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp153: | 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :u10i: nIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h3:2168_: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :d153a:t14a:1 ,warning: unused variable 'data1' [-Wunused-variable]f lag1, d a153t | a 2 , fuliangt23;2 _ t| ^~~~~d ata/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h1:,153 :f35l:a gwarning: 1unused variable 'flag2' [-Wunused-variable], dat a1532 | , f l augi2n;t 3 2| _ ^~~~~t dat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ha:1153,: 21f:l awarning: gunused variable 'flag1' [-Wunused-variable]1 , da t153a | 2 , f luaign2t;3 2 | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ _t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:#562#:a15l:g owarning: ,initializer order does not match the declaration order [-Wreorder-ctor] NCCL_PROTO_# #562p | r o t o >t(i)d.(rtuind()&,n cnctlhSrhemaedms.(wnotrhk)r;e a\d s )| , ^ tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o15c:k (note: tfield 'nthreads' will be initialized after field 'tidInBlock'h readI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ( n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads )563, | t i d IsntBelpoScikz(et(hnrcecaldSIhdmxe.mx.)c,o mgmr.obuupf(fgSriozueps)[,N C C| L ^~~~~~~~~~~~~~~~~_ PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562S:I60M:P Lnote: Efield 'group' will be initialized after field 'stepSize'] /NCCL _562S | T E P S /tsiidz(etoifd()T,) )n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s (| n group(groupt hreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hi:d34I:n7B:l onote: cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herek (thread I34d | x . x ) , gprroiumps((gtriodu,p )n,t h r| e ^~~~~~~~~~~a ds, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]x .x), grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n threa d563s | ( n t h rsetaedpsS)i,z et(indcIcnlBSlhomcekm(.tchormema.dbIudfxf.Sxi)z,e sg[rNoCuCpL(_gPrRoOuTpO)_,S I M| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L E ]| / tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N CCL_S T563E | P S / s iszteeopfS(iTz)e)( n{c c l| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h m e| m group(group. comm.buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hz:e34s:[7N:C Cnote: Lin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ PROTO_S I34M | P L E ] / N CpCrLi_mSsT(EtPiSd/,s inztehorfe(aTd)s), {& r i| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g - >| p group(groupr ev, &ri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hn:g34-:>7n:e xnote: tin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, args->s e34n | d b u f f , parrigmss-(>triedc,v bnutfhfr,e aadrsg,s -&>rriendgO-p>Aprrge,v ,0 ,& rairnggs-->>nceoxntn,I nadregxs,- >asregnsd-b>ucfofn,n Ianrdgesx-)>;r e c| v ^b uff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :a80r:gs->red5O:p Anote: rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereg , 0 ,80 | a r g s -r>ucnoRninnIgnPcroontnoI>n(daerxg)s;) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::5202:: 53note: :in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 80 | 202 | r u n R i n g n(,a rTg,s )R;e d O| p ^, Algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :P202r:o53t:o >note: (in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here) .run (202w | e ) ; | ^ RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cppr:k6E:l1e:m enote: nin instantiation of member function 'RunWork, 1, 2>::run' requested heret e(,) .RrIuNnG(,w eS)I;M P L| E ^, Max,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp :i5n:t13:2 _note: tin instantiation of member function 'RunWork, 1, 2>::run' requested here) | ^5 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:C391O:L95L:_ Fnote: Uexpanded from macro 'IMPL_COLL_FUNC'N C(Redu c391e | , RRIuNnGW,o rSkI ,391 | N C CRLu_nAWLoGrOk_<#n#cacllgFou,n cN#C#CfLu_nPcR,O TtOy_p#e#,p rFoutnoc>#(#)d.ervurne(d&onpce,m .NwCoCrLk_)A;L G\O _ #| # ^a lgo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :N562C:C15L:_ Pnote: Rfield 'nthreads' will be initialized after field 'tidInBlock'O TO_## p562r | o t o > (t)i.dr(utni(d&)n,c cnltShhrmeeamd.sw(onrtkh)r;e a\d s )| , ^ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l15o:c knote: (field 'nthreads' will be initialized after field 'tidInBlock't hread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~( nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s )note: ,field 'group' will be initialized after field 'stepSize' tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~t id/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o:c knote: (field 'group' will be initialized after field 'stepSize't hread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~( nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkEleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreadusf)f,S itziedsI[nNBClCoLc_kP(RtOhTrOe_aSdIIMdPxL.Ex])/,N CgCrLo_uSpT(EgPrS/siozuepo)f,( T )| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ { | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 563 | stepSize(ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.he:m34.:c7o:m mnote: .in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereb uffSizes [34N | C C L _ P R OpTrOi_mSsI(MtPiLdE,] /nNtChCrLe_aSdTsE,P S&/rsiinzge-o>fp(rTe)v), {& r i| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g - >| n group(groupe xt, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h-:>34s:e7n:d bnote: uin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref f, args -34> | r e c v b u fpfr,i masr(gtsi-d>,r endtOhprAeragd,s ,0 ,& rairnggs-->>pcroenvn,I n&dreixn,g -a>rngesx-t>,c oanrngIsn-d>esxe)n;d b u| f ^f , ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hg:s80-:>5r:e cnote: vin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereb uff ,80 | a r g s -r>urneRdiOnpgAoctoon>n(Ianrdgesx),; a r| g ^s ->conn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:n202d:e53x:) ;note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | ^ 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h | : 80 : 5 : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here Run W80o | r k E l ermuennRti,( aPrrgost)o;> ( )| . ^r un(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;202 : 53| : ^ note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp :2027 | : 1 : note: in instantiation of member function 'RunWork, 1, 2>::run' requested here Ru n7W | oIrMkPELl_eCmOeLnLt_ (M)a.xr,u nu(iwnet)3;2 _ t| ) ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h1::391 :note: 95in instantiation of member function 'RunWork, 1, 2>::run' requested here: note: expanded from macro 'IMPL_COLL_FUNC' 7 | IMPL _391C | O L LR_uFnUWNoCr(kR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391N:C95C:L _note: Aexpanded from macro 'IMPL_COLL_FUNC'L GO_##al g391o | , NRCuCnLW_oPrRkOf(u)n.cr,u nt(y&pnec,c lFSuhnmce#m#.dweovrrke)d;o p\< t y| p ^e >, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562A:L15G:O _note: #field 'nthreads' will be initialized after field 'tidInBlock'# algo, 562N | C C L _ PtRiOdT(Ot_i#d#)p,r onttoh>r(e)a.drsu(nn(t&hnrcecaldSsh)m,e mt.iwdoIrnkB)l;o c\k ( t| h ^r eadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x15):, note: gfield 'nthreads' will be initialized after field 'tidInBlock'r oup(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~( tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :n60t:h rnote: efield 'group' will be initialized after field 'stepSize'a ds(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~a dI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x60):, note: gfield 'group' will be initialized after field 'stepSize'r oup(g r562o | u p ) , t i| d ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:)562,: 15g:r owarning: uinitializer order does not match the declaration order [-Wreorder-ctor]p (group), 562| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d60(:t inote: dfield 'group' will be initialized after field 'stepSize') , nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup( g563r | o u p ) ,s t e| p ^~~~~~~~~~~S ize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, r&eraidnIgd-x>.pxr)e,v ,g r&oruipn(gg-r>onuepx)t,, a| r ^~~~~~~~~~~g s->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ng(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement:( )warning: .initializer order does not match the declaration order [-Wreorder-ctor]r un(we); 562| | ^ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp(:t11i:d1):, note: nin instantiation of member function 'RunWork, 1, 2>::run' requested heret hrea d11s | (InMtPhLr_eCaOdLsL)_,F UtNiCd(IRneBdluoccek,( tRhIrNeGa,d ISdIxM.PxL)E,, gMraoxu,p (fglrooautp)) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95 :563 | note: expanded from macro 'IMPL_COLL_FUNC' step S391i | z e (RnucncWloSrhkmS,T ENPCSC/Ls_iAzLeGoOf_(#T#)a)l g{o , | N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C C L| _ group(groupP ROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ho:t34o:>7(:) .note: rin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu n(&nccl S34h | m e m . w o rpkr)i;m s\( t i| d ^, nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15,: ¬e: rfield 'nthreads' will be initialized after field 'tidInBlock'i ng->p r562e | v , & rtiindg(->nteixdt),, anrtghsr-e>asdesn(dnbtuhfrfe,a dasr)g,s -t>irdeIcnvBbluofcfk,( tahrrgesa-d>Irdexd.Oxp)A,r gg,r o0u,p (agrrgosu-p>)c,o n n| I ^~~~~~~~~~~~~~~~~n dex,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :a562r:g60s:- >note: cfield 'group' will be initialized after field 'stepSize'o nnInd e562x | ) ; | t ^i d(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:)80,: 5n:t hnote: rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heree ads (80n | t h r e ardusn)R,i ntgid(xa.rxg)s,) ;g r o| u ^p (group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202 :| 53 ^~~~~~~~~~~: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:n562n:I15n:d ewarning: xinitializer order does not match the declaration order [-Wreorder-ctor]) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h: 80562: | 5 : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heret id( t80i | d ) , nrtuhnrReiandgs<(Tn,t hRreedaOdps,) ,P rtoitdoI>n(Balrogcsk)(;t h r| e ^a dId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:.202x:)53,: gnote: rin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereo up(g r202o | u p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)W orkEle m563e | n t < F ns,t eTp,S iRzeed(Onpc,c lASlhgmoe,m .Pcroomtmo.>b(u)f.frSuinz(ewse[)N;C C L| _ ^P ROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp_:S13I:M1P:L Enote: ]in instantiation of member function 'RunWork, 1, 2>::run' requested here/ NCCL _13S | TIEMPPSL/_sCiOzLeLo_fF(UTN)C)( R{e d u| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e , | R group(groupI NG, SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h,: 34M:a7x:, note: rin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec cl_bfloa t341 | 6 ) | ^ pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:s391(:t95i:d ,note: expanded from macro 'IMPL_COLL_FUNC'n threa d391s | , &RruinnWgo-r>kpunnecx,t ,t yapreg,s -F>usnecn#d#bduefvfr,e daorpgpree>c,v bNuCfCfL,_ AaLrGgOs_-#>#raeldgOop,A rNgC,C L0_,P RaOrTgOs_-#>#cpornontIon>d(e)x.,r uanr(g&sn-c>ccloSnhnmIenmd.ewxo)r;k ) ;| ^\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::5562:: 15note: :in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here note: field 'nthreads' will be initialized after field 'tidInBlock' 80 | 562 | r u n Rtiindg(t(harregasd)s;) , | t ^i dInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k202(:t53h:r enote: ain instantiation of member function 'RunWorkElement, 1, 2>::run' requested hered Idx. x202) | , g r o u p ( gRruonuWpo)r,k E l| e ^~~~~~~~~~~~~~~~~m ent/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h<:F562n:,60 :T ,note: field 'group' will be initialized after field 'stepSize'R edOp, 562A | l g o , tPirdo(ttoi>d()),. rnutnh(rweea)d;s ( n| t ^h reads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp,: 12t:i1d:I nnote: Bin instantiation of member function 'RunWork, 1, 2>::run' requested herel ock (12t | hIrMePaLd_ICdOxL.Lx_)F,U NgCr(oRuepd(ugcreo,u pR)I,N G ,| ^~~~~~~~~~~S IMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h6:4514_:t9*: pwarning: tvariable 'offset' set but not used [-Wunused-but-set-variable]r = r514e | c v P t ri(n0t) +olflf1s2e8tO f=f steitd;; | | ^~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:W562o:r15k:E lwarning: einitializer order does not match the declaration order [-Wreorder-ctor]m entr(e)a.drsu(nn(twher)e;a d s| ) ^, tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cppn:B4l:o1c:k (note: tin instantiation of member function 'RunWork, 1, 2>::run' requested hereh rea d4I | dIxM.PxL)_,C OgLrLo_uFpU(NgCr(oRuepd)u,c e ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R I N| G tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), SIMPL E563, | S u m PsotsetpDSiivz,e (innctc8l_Sthmem.comm.buffSiz tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e s[NCCL_PROT O563_ | S I M P LsEt]e/pNSCiCzLe_(SnTcEcPlSS/hsmiezme.ocfo(mTm).)b u{f f S| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~z e s| [ group(groupN CCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h_:S34I:M7P:L Enote: ]in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/ NCCL_S T34E | P S / s i z eporfi(mTs)()t i{d , | n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds, &ring->prev, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:i34n:g7-:> nnote: ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex t, args- >34s | e n d b u f fp,r iamrsg(st-i>dr,e cnvtbhurfefa,d sa,r g&sr-i>nrge-d>OpprAervg,, &0r,i nagr-g>sn-e>xcto,n naIrngdse-x>,s eanrdgbsu-f>fc,o nanrIgnsd-e>xr)e;c v b| u ^f f, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:g80s:-5>:r enote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereO pA r80g | , 0 , raurngRsi-n>gc>c(oanrngIsn)d;e x )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h202:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:95: note: :expanded from macro 'IMPL_COLL_FUNC'562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | RunWork< n562c | c l F u ntci#d#(ftuindc),, tnytpher,e aFdusn(cn#t#hdreevardesd)o,p B,l oNcCkC(Lt_hArLeGaOd_I#d#xa.lxg)o,, gNrCoCuLp_(PgRrOoTuOp_)#,# p r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t o >| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) .run( &563n | c c l S hsmteemp.Swiozrek()n;c c\l S h| m ^e m.comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:b562u:f15f:S inote: zfield 'nthreads' will be initialized after field 'tidInBlock'e s[NC C562L | _ P R O TtOi_dS(ItMiPdL)E,] /nNtChCrLe_aSdTsE(PnSt/hsriezaedosf)(,T )t)i d{I n B| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o c k| ( group(groupt hreadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ho:u34p:)7,: note: | in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :34562 | : 60 : note: field 'group' will be initialized after field 'stepSize' prim s562( | t i d , tnitdh(rteiadd)s,, n&trhirnega-d>sp(rev, n&trhirnega-d>sn)e,x tt,i daIrngBsl-o>cske(ntdhbruefafd,I daxr.gxs)-,> rgercovubpu(fgfr,o uapr)g,s - >| r ^~~~~~~~~~~e dOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 13 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | intIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppu:p1(: gIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u10p: )In file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h : 167| : ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::15562:: 60warning: :initializer order does not match the declaration order [-Wreorder-ctor] note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppm:s1(: tIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:-10t: iIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hS:t167a: r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:B562c:a15s:t ,warning: initializer order does not match the declaration order [-Wreorder-ctor]n ThreadsBcas t562, | & d i rteicdt(-t>iodu)t,, nntuhlrlepatdrs,( natrhgrse-a>dsse)n,d btuifdfI,n Balrogcsk-(>trhercevabduIfdfx,. x )| , ^ group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r202o:u53p:) ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 202 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | 563 | R u n WsotrekpESliezmee(nntcL(_)P.RrOuTnO(_wSeI)M;P L E| ] ^/ NCCL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppE:P5S:/1s:i znote: ein instantiation of member function 'RunWork, 2, 2>::run' requested hereo f(T) )5 | {I M P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ C O| L group(groupL _FUNC(AllReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:,626 :C9O:L Lnote: Nin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereE T_DIRE C626T | , S I M P L E ,p rMiimns,( tuiidn-tt8i_dtS)t a r| t^S catte/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:,391 :n95T:h rnote: eexpanded from macro 'IMPL_COLL_FUNC'a dsScat t391e | r , RNuUnLWLo,r kdnucp#,# faunc, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rgs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s (nthread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~k ( t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eadId x563. | x ) , gsrtoeuppS(igzreo(unpc)c,l S h| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hock(:t562h:r15e:a dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]d x.x), group(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~( tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :n60t:h rnote: efield 'group' will be initialized after field 'stepSize'a ds(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d I| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x .x), g563r | o u p ( gsrtoeuppS)i,z e (| n ^~~~~~~~~~~c clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hze(n:c562c:l15S:h mwarning: einitializer order does not match the declaration order [-Wreorder-ctor]m .comm.buff S562i | z e s [ NtCiCdL(_tPiRdO)T,O _nStIhMrPeLaEd]s/(NnCtChLr_eSaTdEsP)S,/ stiizdeIonfB(lTo)c)k ({t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d I| d group(groupx .x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:(687g:r11o:u pnote: )in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 687| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | p r ismtse(ptSiidz-et(indcSctlaSrhtmBecma.scto,m mn.TbhurfefaSdiszBecsa[sNtC,C L&_dPiRrOeTcOt_-S>IoMuPtL,E ]n/uNlClCpLt_rS,T EaPrSg/ss-i>zseeonfd(bTu)f)f ,{ a r| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s - >| r group(groupe cvbuff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :641:11:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 641 | 202 | pRruinmWso(rtkiEdl-etmiednStt (d)i.rreucnt(-w>ed)o;w n ,| ^& direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppt:-4>:o1u:t ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested herea rgs -4> | sIeMnPdLb_uCfOfL,L _aFrUgNsC-(>ArlelcRvebduufcfe,, C| O ^L LNET_DI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:E202C:T53,: Snote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereM PLE, M202i | n , i n t 8 _ tR)u n W| o^r kEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:m391e:n95t:< Fnote: nexpanded from macro 'IMPL_COLL_FUNC', T, R e391d | O p ,R uAnlWgoor,k u(n)c.#r#ufnu(nwce,) ;t y p| e ^, Func#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp#:d6e:v1r:e dnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested herep | ,I MNPCLC_LC_OALLLG_OF_U#N#Ca(lAglol,R eNdCuCcLe_,P RCOOTLOL_N#E#Tp_rDoItRoE>C(T),. rSun(&nIcMcPlLSEh,m eMmi.nw,o rikn)t;3 2\_ t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::15391:: 95note: :field 'nthreads' will be initialized after field 'tidInBlock' note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391t | i d (RtuindW)o,r kn,, gNrCoCuLp_(AgLrGoOu_p#)#,a l g| o ^~~~~~~~~~~~~~~~~, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_60P:R Onote: Tfield 'group' will be initialized after field 'stepSize'O _## p562r | o t o > (t)i.dr(utni(d&)n,c cnltShhrmeeamd.sw(onrtkh)r;e a\d s )| , ^ tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k15(:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e adIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~a ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:W562o:r15k:< nwarning: cinitializer order does not match the declaration order [-Wreorder-ctor]c lFunc##f u562n | c , t ytpied,( tFiudn)c,# #ndtehvrreeaddosp(a,d sN)C,C Lt_iAdLIGnOB_l#o#cakl(gtoh,r eNaCdCILd_xP.RxO)T,O _g#r#opurpo(tgor>o(u)p.)r,u n (| & ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n c c| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S hme m563. | w o r k )s;t e\p S i| z ^e (ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562c:o15m:m .note: bfield 'nthreads' will be initialized after field 'tidInBlock'u ffSi 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL _562A | L G O _ #t#iadl(gtoi,d )N,C CnLt_hPrReOaTdOs_(#n#tphrroetaod>s()),. rtuind(I&nnBclcolcSkh(mtehmr.ewaodrIkd)x;. x\) , | g ^r oup(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 15 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: field 'nthreads' will be initialized after field 'tidInBlock' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t isdt(etpiSdi)z,e (nntchcrleSahdmse(mn.tchormema.dbsu)f,f StiizdeIsn[BNlCoCcLk_(PtRhOrTeOa_dSIIdMxP.LxE)],/ NgCrCoLu_pS(TgErPoSu/ps)i,z e o| f ^~~~~~~~~~~~~~~~~( T)) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h{: 562 :| 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | field 'group' will be initialized after field 'stepSize' group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t641i:d11(:t inote: din instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , nthr e641a | d s ( n t h r e a d sp)r,i mtsi(dtIindB-ltoicdkS(ttahrrteRaeddIudcxe.,x )n,T hgrreoaudps(Rgerdouucpe),, d i| r ^~~~~~~~~~~e ct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement562( | ) . r u nt(iwde()t;i d )| , ^ nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppa:d5s:(1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads )5, | ItMiPdLI_nCBOlLoLc_kF(UtNhCr(eAaldlIRdexd.uxc)e,, gCrOoLuLpN(EgTr_oDuIpR)E,C T ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S I M| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L E, M i563n | , u i nstt8e_ptS)i z e| (^n cclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:m391e:m95.:c onote: mexpanded from macro 'IMPL_COLL_FUNC'm .buffS i391z | e s [RNuCnCWLo_rPkR group(group, NCCL_ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h#:#655a:l11g:o ,note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereN CCL_P R655O | T O _ # # p r o t o >p(r)i.mrsu(nt(i&dn-ctcildSShtmaermt.Rweodrukc)e;, \n T h| r ^e adsR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562u:c15e:, note: nfield 'nthreads' will be initialized after field 'tidInBlock'u llpt r562, | & d i rteicdt(-t>iodu)t,, natrhgrse-a>dsse(nndtbhurfefa,d sa)r,g st-i>drIencBvlboucfkf(,t h r| e ^a dIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202g:r53o:u pnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg rou p202) | , | ^~~~~~~~~~~~~~~~~ R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562W:o60r:k Enote: lfield 'group' will be initialized after field 'stepSize'e ment <562F | n , T ,t iRde(dtOipd,) ,A lngtoh,r ePardost(on>t(h)r.eraudns()w,e )t;i d I| n ^B lock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpph:r6e:a1d:I dnote: xin instantiation of member function 'RunWork, 2, 2>::run' requested here. x), g6r | oIuMpP(Lg_rCoOuLpL)_,F U N| C ^~~~~~~~~~~( AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hthre:a562d:I15d:x .warning: xinitializer order does not match the declaration order [-Wreorder-ctor]) , group(group) ,562 | | ^~~~~~~~~~~~~~~~~ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d60):, note: nfield 'group' will be initialized after field 'stepSize't hread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d x .| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , gr o563u | p ( g r osutpe)p,S i z| e ^~~~~~~~~~~( ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht8_t:)562 : 15| :^ warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | t391i | d ( tRiudn)W,o rnkt ,g rNoCuCpL(_gArLoGuOp_)#,# a l| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o , | N tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C CL_P R563O | T O _ # #sptreoptSoi>z(e)(.nrcucnl(S&hnmcecml.Schommemm..bwuofrfkS)i;z e\s [ N| C ^C L_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:S562I:M15P:L Enote: ]field 'nthreads' will be initialized after field 'tidInBlock'/ NCCL_ S562T | E P S / stiizde(otfi(dT)),) n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ( group(groupn threads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hB:l641o:c11k:( tnote: hin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eadId x641. | x ) , g r o u p ( gprroiumps)(,t i d| - ^~~~~~~~~~~~~~~~~t idSta/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:t562R:e60d:u cnote: efield 'group' will be initialized after field 'stepSize', nThre a562d | s R e d utcied,( tdiidr)e,c tn-t>hdroewand,s (&ndtihrreecatd-s>)o,u tt,i daIrngBsl-o>cske(ntdhbruefafd,I daxr.gxs)-,> rgercovubpu(fgfr,o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562 | : 562 : 15 :t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]( tid), nthreads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I d x| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x ), gr o563u | p ( g r osutpe)p,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n c| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l Shmem .563c | o m m . bsutfefpSSiizzees([nNcCcClLS_hPmReOmT.Oc_oSmImM.PbLuEf]f/SNiCzCeLs_[SNTCECPLS_/PsRiOzTeOo_fS(ITM)P)L E{] / N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L| _ group(groupS TEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h): 677{: 11 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | group(group 677 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:r626i:m9s:( tnote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered -tidS t626a | r t B c a s t , pnrTihmrse(atdisdB-ctaisdtS,t a&rdtiSrceacttt-e>ro,u tn,T hdrieraedcstS-c>adtotwenr,, aNrUgLsL-,> sdeinrdebcutf-f>,u pa,r gasr-g>sr-e>csvebnudfbfu,f f ,| ^a rgs->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:c202v:b53u:f fnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :R53u:n Wnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer kEle m202e | n t < F n , T ,R uRneWdoOrpk,E lAelmgeon,t ,( )R.erduOnp(,w eA)l;g o ,| ^P roto>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp):.6r:u1n:( wnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested here) ; | 6 ^ | IMPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppO:L6L:_1F:U Nnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested here( AllR e6d | uIcMeP,L _CCOOLLLLN_EFTU_NDCI(RAElClTR,e dSuIcMeP,L EC,O LMLiNnE,T _iDnItR3E2C_Tt,) S I| M^P LE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :M391i:n95,: inote: nexpanded from macro 'IMPL_COLL_FUNC't 32_t) 391| | ^ RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:r391k:<95n:c cnote: lexpanded from macro 'IMPL_COLL_FUNC'F unc##f u391n | c , Rtuype, Funcn#W#odrekvf,u nNcC,C Lt_yApLeG,O _F#u#nacl#g#od,e vNrCeCdLo_pP#,# pNrCoCtLo_>A(L)G.Or_u#n#(a&lngcoc,l SNhCmCeLm_.PwRoOrTkO)_;# #\p r o| t ^o >()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:u562n:(15&:n cnote: cfield 'nthreads' will be initialized after field 'tidInBlock'l Shme m562. | w o r k )t;i d\( t i| d ^) , nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15(:n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~o ck(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d Inote: dfield 'group' will be initialized after field 'stepSize'x .x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~n thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~l ock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ect->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:s562-:>15s:e nwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]b uff, arg s562- | > r e c vtbiudf(ft,i d )| , ^ nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202(:n53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea ds), 202t | i d I n B l o c kR(utnhWroerakdEIldexm.exn)t,< Fgnr,o T,u pR(egdrOopu,p )A,l g o| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ P r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t o>(). r563u | n ( w e )s;t e p| S ^i ze(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppS:h5m:e1m:. cnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested herem m.bu f5f | SIiMzPeLs_[CNOCLCLL__FPURNOCT(OA_lSlIRMePdLuEc]e/,N CCCOLL_LSNTEETP_SD/IsRiEzCeTo,f (STI)M)P L{E , | M ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i n ,| group(groupu int8_t) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h^: 666:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 666 | 391 | RpurniWmosr(kt#udpe,v rNeUdLoLp,< tayrpges>-,> sNeCnCdLb_uAfLfG,O _a#r#gasl-g>or,e cNvCbCuLf_fP,R O T| O ^_ ##proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:(202):.53r:u nnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here& nccl S202h | m e m . w o r k )R;u n\W o r| k ^E lemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:<562F:n15,: Tnote: ,field 'nthreads' will be initialized after field 'tidInBlock' RedOp ,562 | A l g o ,t iPdr(ottiod>)(,) .nrtuhnr(ewaed)s;( n t| h ^r eads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppi:d6I:n1B:l onote: cin instantiation of member function 'RunWork, 2, 2>::run' requested herek (thre a6d | IIdMxP.Lx_)C,O LgLr_oFuUpN(Cg(rAolulpR)e,d u c| e ^~~~~~~~~~~~~~~~~, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562N:E60T:_ DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15I:n Bwarning: linitializer order does not match the declaration order [-Wreorder-ctor]o ck(threadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~s (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 562 :| 15 group(group: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :562655 | : 11 : note: tin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei d(tid) ,655 | n t h r e a d s ( n tphrriemasd(st)i,d -ttiiddISntBalrotcRke(dtuhcree,a dnITdhxr.exa)d,s Rgerdouucpe(,g rnouulpl)p,t r ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~& d i| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ct-> o563u | t , a rsgtse-p>Ssiezned(bnucfcfl,S hamregms.-c>ormemc.vbbuuffffS,i z e| s ^[ NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:_202S:I53M:P Lnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here] /NCCL _202S | T E P S / s i z eRoufn(WTo)r)k E{l e m| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t <| F group(groupn , T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 641R:e11d:O pnote: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Algo, P641r | o t o > ( ) . r u n (pwrei)m;s ( t| i ^d -tidStar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppt:R7e:d1u:c enote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here nThr e7a | dIsMRPeLd_uCcOeL,L _dFiUrNeCc(tA-l>ldRoewdnu,c e&,d iCrOeLcLtN-E>To_uDtI,R EaCrTg,s -S>IsMePnLdEb,u fMfi,n ,a rugisn-t>3r2e_ctv)b u f| f^, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :expanded from macro 'IMPL_COLL_FUNC'202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 391 | R202u | n W o r k < n c cRluFnuWnocr#k#Efluenmce,n tto,> (N)C.CrLu_nA(LwGeO)_;# # a| l ^g o, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp_:P6R:O1T:O _note: #in instantiation of member function 'RunWork, 2, 2>::run' requested here# prot o6> | (I)M.PrLu_nC(O&LnLc_cFlUSNhCm(eAml.lwRoerdku)c;e ,\ C O| L ^L NET_DI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:E562C:T15,: Snote: Ifield 'nthreads' will be initialized after field 'tidInBlock'M PLE, M562i | n , i ntti3d2(_tti)d ) ,| ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391(:n95t:h rnote: eexpanded from macro 'IMPL_COLL_FUNC'a ds), t i391d | InBlock(threadIdx.x), group(grou p )R,u n W| o ^~~~~~~~~~~~~~~~~r kn,t hNrCeCaLd_sA)L,G Ot_i#d#IanlBgloo,c kN(CtChLr_ePaRdOITdOx_.#x#)p,r ogtroo>u(p)(.grruonu(p&)n,c c l| S ^~~~~~~~~~~h mem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tiodInBlrokc:, note: Nfield 'group' will be initialized after field 'stepSize'C CL_AL G562O | _ # # a ltgiod,( tNiCdC)L,_ PnRtOhTrOe_a#d#sp(rnotthor>e(a)d.sr)u,n (t&indcIcnlBSlhomcekm(.twhorreka)d;I d\x . x| ) ^, group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 15 ^~~~~~~~~~~: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:E562,: 15M:i nwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] int32_t )562 | | ^ tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:)391,: 95n:t hnote: rexpanded from macro 'IMPL_COLL_FUNC'e ads(nthr e391a | d s )R,u ntWiodrIkn , NC C563L | _ steApLSGiOz_e#(#naclcgloS,h mNeCmC.Lc_oPmRmO.TbOu_f#f#Spirzoetso[>N(C)C.Lr_uPnR(O&TnOc_cSlISMhPmLeEm]./wNoCrCkL)_;S T\E P S| / ^s izeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:)562): 15{: note: | field 'nthreads' will be initialized after field 'tidInBlock' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d626(:t9i:d )note: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here nthr e626a | d s ( n t h r e apdrsi)m,s (ttiiddI-ntBildoSctka(rtthSrceaatdtIedrx,. xn)T,h rgeraoduspS(cgartotuepr),, N U| L ^~~~~~~~~~~~~~~~~L , dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562t:-60>:u pnote: ,field 'group' will be initialized after field 'stepSize' args->s e562n | d b u f ft,i da(rtgisd-)>,r enctvhbruefafd,s ( n| t ^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)202,: 53t:i dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Blo c202k | ( t h r e a d I dRxu.nxW)o,r kgErloeumpe(ngtr().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O15_:# #warning: ainitializer order does not match the declaration order [-Wreorder-ctor]l go, NCC L562_ | P R O T Ot_i#d#(ptriodt)o,> (n)t.hrruena(d&sn(cnctlhSrhemaedms.)w,o rtki)d;I n\B l o| c ^k (threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pnote: (field 'nthreads' will be initialized after field 'tidInBlock'g roup), | 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(t i563d | ) , n tshtreepaSdisz(en(tnhcrcelaSdhsm)e,m .tciodmImn.BbluofcfkS(itzherse[aNdCICdLx_.PxR)O,T Og_rSoIuMpP(LgEr]o/uNpC)C,L _ S| T ^~~~~~~~~~~~~~~~~E PS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:i562z:e60o:f (note: Tfield 'group' will be initialized after field 'stepSize') ) { | 562 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677n:t11h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s(nth r677e | a d s ) , t i d I npBrliomcsk((ttihdr-etaiddISdtxa.rxt)B,c agsrto,u pn(Tghrroeuapd)s,B c a| s ^~~~~~~~~~~t , &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562(:we15):; warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7: 1562: | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here ti d7( | tIiMdP)L,_ CnOtLhLr_eFaUdNsC((nAtlhlrReeaddusc)e,, tCiOdLILnNBElTo_cDkI(RtEhCrTe,a dSIIdMxP.LxE),, Mgirno,u pu(ignrto3u2p_)t,) | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95 :563 | note: expanded from macro 'IMPL_COLL_FUNC' ste p391S | i z eR(unncWcolrSkh_,S TNECPCSL/_sAiLzGeOo_f#(#Ta)l)g o{, N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| P group(groupR OTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:t677o:>11(:) .note: rin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu n(&nc c677l | S h m e m . w o r k )p;r i\m s (| t ^i d-tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:t562a:r15t:B cnote: afield 'nthreads' will be initialized after field 'tidInBlock's t, nT h562r | e a d s Btciads(tt,i d&)d,i rnetchtr-e>aodust(,n tdhirreeacdts-)>,d otwind,I naBrlgosc-k>(stehnrdebaudfIfd,x .axr)g,s -g>rroeucpv(bgurfofu,p ) ,| ^ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::6053:: note: note: field 'group' will be initialized after field 'stepSize'in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202562 | | t i d (RtuindW)o,r knEtlhermeeandts<(Fnnt,h rTe,a dRse)d,O pt,i dAIlngBol,o cPkr(otthor>e(a)d.Irduxn.(xw)e,) ;g r o| u ^p (group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp):,6 : 1| : ^~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | ^~~~~~~~~~~~~~~~~ :562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::15562:: 60warning: :initializer order does not match the declaration order [-Wreorder-ctor] note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd(ti:d562):,15 :n twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eads(nthreads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :a562r:g15s:- >warning: rinitializer order does not match the declaration order [-Wreorder-ctor]e cvbuff, | 562 ^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202(:t53i:d )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here nthr e202a | d s ( n t h r e aRdusn)W,o rtkiEdlIenmBelnotcp()),. r u| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( w e| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T); | ^ 563 | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppt:e9p:S1i:z enote: (in instantiation of member function 'RunWork, 2, 2>::run' requested heren cclS h9m | eImM.PcLo_mCmO.LbLu_fFfUSNiCz(eAsl[lNRCeCdLu_cPeR,O TCOO_LSLINMEPTL_ED]I/RNECCCTL,_ SSTIEMPPSL/Es,i zMeionf,( Tu)i)n t{6 4 _| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) | | group(group^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::95666:: 9note: :expanded from macro 'IMPL_COLL_FUNC' note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | 666 | R u n W o r k < npcrcilmFsu(ntci#d#,f unnTch,r etaydpseG,a tFhuenrc,# #ddiervercetd-o>puL,L ,N CaCrLg_sA-L>GsOe_n#d#baulfgfo,, aNrCgCsL-_>PrReOcTvOb_u#f#fp,r o t| o ^> ().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h&:n202c:c53l:S hnote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree m.wor k202) | ; \ | ^ Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:W562o:r15k:E lnote: efield 'nthreads' will be initialized after field 'tidInBlock'm ent< F562n | , T , tRiedd(Otpi,d )A,l gnot,h rPeraodtso(>n(t)h.rreuads)n,( wtei)d;I n B| l ^o ck(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppe:a8d:I1d:x .note: xin instantiation of member function 'RunWork, 2, 2>::run' requested here) , gr o8u | pI(MgPrLo_uCpO)L,L _ F| U ^~~~~~~~~~~~~~~~~N C(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:l562l:R60e:d unote: cfield 'group' will be initialized after field 'stepSize'e , CO L562L | N E T _ DtIiRdE(CtTi,d )S,I MnPtLhEr,e aMdisn(,n tihnrte6a4d_st)), t| i^d InBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k391(:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd Idx.x), 391g | r o uRpu(ngWroorukp<)n,c c l| F ^~~~~~~~~~~u nc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ llptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]d x.x), grou p562(grou | p ) , t| i ^~~~~~~~~~~~~~~~~d (tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :n60t:h rnote: efield 'group' will be initialized after field 'stepSize'a ds(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I dx.x), 563g | r o u p (sgtreopuSpi)z,e ( n| c ^~~~~~~~~~~c lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:a562r:t15R:e dwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]c e, nThr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he: a562562d: | s15 R: e dwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]tc ied,( tn562iu | dl )l ,p t nrtt,i hd&r(detiairddes)c(,tn -tn>htorhuertae,da sda)sr,(g nstt-ih>drsIeenanBddlsbo)uc,fk f(t,ti hdraIernagBdslI-od>cxrk.e(xct)vh,br uegfarfdo,Iu dp x(| .g ^xr )o,u pg)r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,o: u202 p:| (53 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g: r onote: | uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p ), | 202 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 563 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) s 563t | eR pu Sn iW zosert(kenEpclSceilmzSeehn(mtneCO(L_)_S.PIrRMuOPnTL(OEw_]eS/)IN;MC PC LL| E_ ^]S /TNECPCSL//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp_s:Si9Tz:Ee1Po:Sf /(note: sTin instantiation of member function 'RunWork, 2, 2>::run' requested herei) z)e o {f9 ( | TI| )M ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~)P L {_| C group(group O | L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L _ F| U group(groupN C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:A687l:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl11:R:677e :dnote: 11uin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here:c enote: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here C O687L | L 677N | E T _ D I R E C Tp ,r ipSmrIsiM(mPtsLi(Edt,-i tdMi-idtnSi,td aSurtitanBrtct6aB4sc_tat,s) t n, T | hn^rT eharde/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hsa:Bd391csa:Bs95ct: note: expanded from macro 'IMPL_COLL_FUNC' a391 | Rsutn,W o&rdkiuonuct#,# fduinrce,c tt-y>pdeo,w nF,u nacr#g#sd-e>vsreenddobpug,s -N>CrCeLc_vAbLuGfOf_,# # a| l ^g o, NCCL_PROTO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:p202:r53o:t onote: >in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( ).run (202& | n c c l S h m e mR.uwnoWrokr)k;E l\e m e| n ^t ().ru n562( | w e ) ; t i| d ^( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :n8t:h1r:e anote: din instantiation of member function 'RunWork, 2, 2>::run' requested heres (nt h8r | eIaMdPsL)_,C OtLiLd_IFnUBNlCo(cAkl(ltRherdeuacdeI,d xC.OxL)L,N EgTr_oDuIpR(EgCrTo,u pS)I,M P L| E ^~~~~~~~~~~~~~~~~, Min,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :i562n:t606:4 _note: tfield 'group' will be initialized after field 'stepSize') | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'( tid) ,391 | n t hRruenaWdosr(knr,o uNpC)C,L _ A| L ^~~~~~~~~~~G O_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :warning: 8initializer order does not match the declaration order [-Wreorder-ctor]: 1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5628 | | I M P Lt_iCdO(LtLi_dF)U,N Cn(tAhlrleRaeddsu(cnet,h rCeOaLdLsN)E,T _tDiIdRIEnCBTl,o cSkI(MtPhLrEe,a dMIidnx,. xi)n,t 6g4r_otu)p ( g| r^o up), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 95 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: expanded from macro 'IMPL_COLL_FUNC' 563 | 391 | sRtuenpWSoirzke<(nnccccllFSuhnmce#m#.fcuonmcm,. btuyfpfeS,i zFeusn[cN#C#CdLe_vPrReOdToOp_E,] /NNCCCCLL__ASLTGEOP_S#/#sailzgeoo,f (NTC)C)L _{P R O| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O _ #| # group(groupp roto>().run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:l687S:h11m:e mnote: .in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herew ork); \687 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562i:m15s:( tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd -tidS t562a | r t B c atsitd,( tniTdh)r,e andtshBrceaasdts,( n&tdhirreeacdts-)>,o utti,d InnuBlllopctkr(,t harregasd-I>dsxe.nxd)b,u fgfr,o uapr(ggsr-o>urpe)c,v b u| f ^~~~~~~~~~~~~~~~~f , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53 :562 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ti d202( | t i d ) , n t hRruenaWdso(rnktEhlreemaednst)<,F nt,i dTI,n BRleodcOkp(,t hArlegaod,I dPxr.oxt)o,> (g)r.oruupn((gwreo)u;p ) ,| ^ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:i562m:s15(:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]- tidSt a562r | t S c a tttiedr(,t indT)h,r enatdhsrSecaadtst(enrt,h rNeUaLdLs,) ,d itriedcItn-B>luopc,k (atrhgrse-a>dsIednxd.bxu)f,f ,g raorugps(-g>rroeucpv)b,u f f| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h563: | 202 : 53 : snote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree pSiz e202( | n c c l S h m e mR.ucnoWmomr.kbEulfefmSeinzteS(T)E.PrSu/ns(iwzee)o;f ( T| ) ^) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 8 :| 1 group(group: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :8626 | :I9M:P Lnote: _in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC OLL_FU N626C | ( A l l R e d u cper,i mCsO(LtLiNdE-Tt_iDdISRtEaCrTt,S cSaItMtPeLrE,, nMTihnr,e aidnstS6c4a_ttt)e r ,| ^N ULL,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :d391i:r95e:c tnote: -expanded from macro 'IMPL_COLL_FUNC'> up, ar g391s | - > sReunndWbourfkf<,n cacrlgFsu-n>cr#e#cfvubnucf,f ,t y p| e ^, Func/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:#202d:e53v:r enote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo p , N C C L _RAuLnGWOo_r#k#Eallegmoe,n tNo(,) .Prruont(o&>n(c)c.lrSuhnm(ewme.)w;o r k| ) ^; \ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :9:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWork, 2, 2>::run' requested here: 15: note: field 'nthreads' will be initialized after field 'tidInBlock'9 | IMPL _562C | O L L _ FtUiNdC((tAildl)R,e dnutcher,e aCdOsL(LnNtEhTr_eDaIdRsE)C,T ,t iSdIIMnPBLlEo,c kM(itnh,r euaidnItd6x4._xt)), g| r^o up(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p95):, note: expanded from macro 'IMPL_COLL_FUNC'| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :39160 | : note: Rfield 'group' will be initialized after field 'stepSize'u nWork <562n | c c l F utnicd#(#tfiudn)c,, nttyhpree,a dFsu(nnct#h#rdeeavdrse)d,o ptl,o cNkC(CtLh_rAeLaGdOI_d#x#.axl)g,o ,g rNoCuCpL(_gPrRoOuTpO)_,# # p| r ^~~~~~~~~~~o to>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hre:oa562ud:ps15((:gn rtwarning: ohinitializer order does not match the declaration order [-Wreorder-ctor]ur pe)a,d s )562| , | ^~~~~~~~~~~ t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c k (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h rea d563I | d x . x )s,t egprSoiuzpe((gnrcoculpS)h,m e m| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c o m| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). buff S563i | z e s [ NsCtCeLp_SPiRzOeT(On_cScIlMSPhLmEe]m/.NcCoCmLm_.SbTuEfPfSS/isziezse[oNfC(CTL)_)P R{O T O| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S I M| P group(groupL E]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:S655T:E11P:S /note: sin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei zeof (655T | ) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | p group(groupr ims(tid-t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d666S:t9a:r tnote: Rin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree duce, 666n | T h r e a d s R epdruicmes,( tniudl,l pntTrh,r e&addisrGeactth-e>ro,u td,i raercgts-->>uspe,n dNbUuLfLf,, aarrggss-->>sreencdvbbuuffff,, a r| g ^s ->rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hv:b202u:f53f:, note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here| ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53R:u nnote: Win instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo rkEle m202e | n t < F /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), n, T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ent().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562c:o15m:m .warning: binitializer order does not match the declaration order [-Wreorder-ctor]u ffSizes[ N562C | C L _ P RtOiTdO(_tSiIdM)P,L En]t/hNrCeCaLd_sS(TnEtPhSr/esaidzse)o,f (tTi)d)I n{B l o| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k ( t| h group(groupr eadIdx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:r655o:u11p:( gnote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo up), 655| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) p563r | i m s ( tsitde-ptSiidzSet(anrctcRleSdhumceem,. cnoTmhmr.ebaudfsfRSeidzuecse[,N CnCuLl_lPpRtOrT,O _&SdIiMrPeLcEt]-/>NoCuCtL,_ SaTrEgPsS-/>ssieznedobfu(fTf),) a{r g s| - ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~> r e| c group(groupv buff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^626 :9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :626 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | p r i m s ( t i dR-utniWdoSrtkaErlteSmceanttt (d)i.rreucnt(-w>eu)p;, a| r ^g s->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppb:u9f:f1,: anote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereg s->r e9c | vIbMuPfLf_,C O L| L ^_ FUNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:R202e:d53u:c enote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here COLL N202E | T _ D I R E C T ,R uSnIWMoPrLkEE,l eMmienn,t ().run (391w | e ) ;R u n| W ^o rk, 2, 2>::run' requested heren c, t y8p | eI,M PFLu_nCcO#L#Ld_eFvUrNeCd(oAplc,e ,N CCCOLL_LANLEGTO__D#I#RaElCgTo,, SNICMCPLL_EP,R OMTiOn_,# #ipnrto6t4o_>t()) . r| u^n (&ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:S391h:m95e:m .note: wexpanded from macro 'IMPL_COLL_FUNC'o rk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | 666 | R u n W o r k oupp<,t yNpUeL>L,, NaCrCgLs_-A>LsGeOn_d#b#uaflfg,o ,a rNgCsC-L>_rPeRcOvTbOu_f#f#,p r o| t ^o >().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:&202n:c53c:l Snote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem em.wo r202k | ) ; \ | ^ RunWorkElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:<562F:n15,: Tnote: ,field 'nthreads' will be initialized after field 'tidInBlock' RedOp, A562l | g o , Ptriodt(ot>i(d)).,r unnt(hwree)a;d s (| n ^t hreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp,: 8t:i1d:I nnote: Bin instantiation of member function 'RunWork, 2, 2>::run' requested herel ock( t8h | rIeMaPdLI_dCxO.LxL)_,F UgNrCo(uApl(lgRreoduupc)e,, C| O ^~~~~~~~~~~~~~~~~L LNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:D562I:R60E:C Tnote: ,field 'group' will be initialized after field 'stepSize' SIMPL E562, | M i n ,t idi(ntti6d4)_,t )n t h| r^e ads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d snote: )expanded from macro 'IMPL_COLL_FUNC', tidIn B391l | o c kR(utnhWroerakd, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: 562note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ti d666( | t i d ) , n t hprreiamdss((tnitdh,r enaTdhsr)e,a dtsiGdaItnhBelro,c kd(itrherceta-d>Iudpx,. xN)U,L Lg,r oaurpg(sg-r>osuepn)d,b u f| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, a| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g s->r e563c | v b u f fs,t e p| S ^i ze(ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:m202e:m53.:c onote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem .buff S202i | z e s [ N C C L _RPuRnOWToOr_kSEIlMePmLeEn]t/ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( ) .| r group(groupu n(we); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 655 ^: 11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1 :655 | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | I MpPrLi_mCsO(LtLi_dF-UtNiCd(SAtlalrRteRdeudcuec,e ,C OnLTLhNrEeTa_dDsIRReEdCuTc,e ,S InMuPlLlEp,t rM,i n&,d iurienctt6-4>_otu)t , | a^r gs->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:e391n:d95b:u ffnote: ,expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha 391r | g s -:R>562ur:ne15Wc:ov rbwarning: kuinitializer order does not match the declaration order [-Wreorder-ctor], 2, 2>::run' requested heret, i dF )u202,n | c n# t# hd re ev ar de sdR(ounpntls,e) m,Ne CntCtiLp#((p)gr.rorotuuonp>()(w,)e .) r;| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n (| & ^| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c clS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpph: m7563e: | m1 .: w onote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heresk t)e; p 7S\ | i Iz Me| P( ^Ln _cCcOl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hLS:Lh562_m:Fe15Um:N. Ccnote: (ofield 'nthreads' will be initialized after field 'tidInBlock'Am lml.R be562ud | fu fc Se i, z teCisOd[L(NLtCNiCEdLT)__,PD RInORtTEhOCr_TeS,aI dMSsPI(LMnEPt]Lh/ErN,eC aCMdLis_n)S,,T EutPiSd/IsniBzleoocfk((Tt)h)r e{a d I| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x . x| ) group(group, group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677 :| 11 ^~~~~~~~~~~~~~~~~: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60677: | note: field 'group' will be initialized after field 'stepSize' 562p | r i m s (ttiidd(-ttiidd)S,t anrtthBrceaasdts,( nntThhrreeaaddss)B,c atsitd,I n&Bdliorcekc(tt-h>roeuatd,I ddxi.rxe)c,t -g>rdoouwpn(,g raorugps)-,> s e| n ^~~~~~~~~~~d buff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:)562): 15{: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 687 : 11 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid), 687n | t h r e a d s ( n t hprreiamdss()t,i dt-itdiIdnSBtlaorctkB(ctahsrte,a dnITdhxr.exa)d,s Bgcraosutp,( g&rdoiurpe)c,t - >| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u t ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n ullpt r563, | a r g ss-t>espeSnidzbeu(fnfc,c laSrhgmse-m>.rceocmvmb.ubfuff,f S i| z ^e s[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_202P:R53O:T Onote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS IMPL E202] | / N C C L _ S T ERPuSn/WsoirzkeEolfe(mTe)n)t <{F n ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T , | R group(groupe dOp, Algo, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o666t:o9>:( )note: .in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer un(we) ;666 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppp:r8i:m1s:( tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered , n T8h | rIeMaPdLs_GCaOtLhLe_rF,U NdCi(rAelcltR-e>duupc,e ,N UCLOLL,L NaErTg_sD-I>RsEeCnTd,b uSfIfM,P LaEr,g sM-i>nr,e civnbtu6f4f_,t ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391202::9553:: note: note: expanded from macro 'IMPL_COLL_FUNC'in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 391 | R u n W oRruknt(y)p.er>u,n (NwCeC)L;_ A L| G ^O _##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppo:,7 :N1C:C Lnote: _in instantiation of member function 'RunWork, 2, 2>::run' requested hereP ROTO _7# | #IpMrPoLt_oC>O(L)L._rFuUnN(C&(nAclcllRSehdmuecme.,w oCrOkL)L;N E\T _ D| I ^R ECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :S562I:M15P:L Enote: ,field 'nthreads' will be initialized after field 'tidInBlock' Min, ui n562t | 3 2 _ t )t i d| (^t id), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC'( nthrea d391s | ) , RtuindWIonrBkl:,60 :N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _ALGO _562# | # a l g ot,i dN(CtCiLd_)P,R OnTtOh_r#e#apdrso(tnot>h(r)e.ardusn)(,& ntcicdlISnhBmleomc.kw(otrhkr)e;a d\I d x| . ^x ), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(15g:r onote: ufield 'nthreads' will be initialized after field 'tidInBlock'p ), | ^~~~~~~~~~~562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :N562C:C15L:_ Awarning: Linitializer order does not match the declaration order [-Wreorder-ctor]G O_##algo, 562N | C C L _ PtRiOdT(Ot_i#d#)p,r onttoh>r(e)a.drsu(nn(t&hnrecacdlsS)h,m etmi.dwIonrBkl)o;c k\( t h| r ^e adIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p(gr o562u | p ) , t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ( t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d ), nt h563r | e a d s (snttehprSeiazdes()n,c ctliSdhImneBml.occokm(mt.hbruefafdSIidzxe.sx[)N,C CgLr_oPuRpO(TgOr_oSuIpM)P,L E ]| / ^~~~~~~~~~~~~~~~~N CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:T562E:P60S:/ snote: ifield 'group' will be initialized after field 'stepSize'z eof(T )562) | { | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d (| t group(groupi d), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a626d:s9(:n tnote: hin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads) ,626 | t i d I n B l o cpkr(itmhsr(etaiddI-dtxi.dxS)t,a rgtrSocuapt(tgerro,u pn)T,h r e| a ^~~~~~~~~~~d sScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h NCC:L562_:A15L:G Owarning: _initializer order does not match the declaration order [-Wreorder-ctor]# #algo, NCCL _562P | R O T O _t#i#dp(rtoitdo)>,( )n.trhurne(a&dnsc(cnltShhrmeeamd.sw)o,r kt)i;d I\n B l| o ^c k(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(grou p562) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id), 563n | t h r e asdtse(pnStihzree(andcsc)l,S htmiedmI.ncBolmomc.kb(utfhfrSeiazdeIsd[xN.CxC)L,_ PgRrOoTuOp_(SgIrMoPuLpE)],/ N C| C ^~~~~~~~~~~~~~~~~L _ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:P562S:/60s:i znote: efield 'group' will be initialized after field 'stepSize'o f(T) )562 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| ( group(groupt id), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n687t:h11r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres ), ti d687I | n B l o c k ( t h r epardiImdsx(.txi)d,- tgirdoSutpa(rgtrBocuaps)t,, n| T ^~~~~~~~~~~h readsBcast, &direct->out, nullptr, args->sendbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:f562,: 15a:r gwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]- >recvbu f562f | , | ^t id(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202n:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s(nt h202r | e a d s ) , t iRduInnWBolrokcEkl(etmherneta ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( ) .| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u n(we) ;563 | | ^ stepSize(nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppc:l8S:h1m:e mnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herec omm. b8u | fIfMSPiLz_eCsO[LNLC_CFLU_NPCR(OATlOl_RSeIdMuPcLeE,] /CNOCLCLLN_ESTT_EDPISR/EsCiTz,e oSfI(MTP)L)E ,{ M i| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, i| n group(groupt 64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h11::391 :note: 95in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: expanded from macro 'IMPL_COLL_FUNC' 677 | 391 | R u n W oprrkic,t -N>CoCuLt_,A LdGiOr_e#c#ta-l>gdoo,w nN,C CaLr_gPsR-O>TsOe_n#d#bpurfoft,o >a(r)g.sr-u>nr(e&cnvcbculfSfh,m e m| . ^w ork); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15202: | note: field 'nthreads' will be initialized after field 'tidInBlock' R562u | n W o r ktEilde(mteindt)<,F nn,t hTr,e aRdesd(Onpt,h rAelagdos,) ,P rtoitdoI>n(B)l.orcukn((twher)e;a d I| d ^x .x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppo:u10p:(1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep ), | 10 ^~~~~~~~~~~~~~~~~ | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562C:O60L:L _note: Ffield 'group' will be initialized after field 'stepSize'U NC(Al l562R | e d u c et,i dC(OtLiLdN)E,T _nDtIhRrEeCaTd,s (SnItMhPrLeEa,d sM)i,n ,t ihdaIlnfB)l o c| k^( thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391I:d95x:. xnote: )expanded from macro 'IMPL_COLL_FUNC', grou p391( | g r oRuupn)W,o r k| < ^~~~~~~~~~~n cclFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork#(#)d.ervurne(d&onpce,m .NwCoCrLk_)A;L G\O _ #| # ^a lgo, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:R562O:T15O:_ #note: #field 'nthreads' will be initialized after field 'tidInBlock'p roto>( )562. | r u n ( &tnicdc(ltSihdm)e,m .nwtohrrke)a;d s\( n t| h ^r eads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15I:n Bnote: lfield 'nthreads' will be initialized after field 'tidInBlock'o ck(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Bloc k562( | t h r e atdiIdd(xt.ixd)),, gnrtohurpe(agdrso(unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I60n:B lnote: ofield 'group' will be initialized after field 'stepSize'c k(thre a562d | I d x . xt)i,d (gtriodu)p,( gnrtohurpe)a,d s (| n ^~~~~~~~~~~t hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 :60: note: field 'group' will be initialized after field 'stepSize'563 | 562s | t e p S itzied((ntcicdl)S,h mnetmh.rceoamdms.(bnutfhfrSeiazdess[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 202warning: | initializer order does not match the declaration order [-Wreorder-ctor] 562R | u n W o rtkiEdl(etmiedn)t,< Fnnt,h rTe,a dRse(dnOtph,r eAaldgso),, PtriodtIon>B(l)o.crku(nt(hwree)a;d I d| x ^. x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppu:p7(:g1r:o unote: pin instantiation of member function 'RunWork, 2, 2>::run' requested here) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~7 | I| M tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)P L_COL L563_ | F U N C (sAtlelpRSeidzuec(en,c cClOSLhLmNeEmT._cDoImRmE.CbTu,f fSSIiMzPeLsE[,N CMCiLn_,P RuOiTnOt_3S2I_MtP)L E ]| /^N CCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:E391P:S95/:s inote: zexpanded from macro 'IMPL_COLL_FUNC'e of(T)) {391 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R u n| W group(groupo rk, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec , typ e687, | F u n c # # d e v rperdiomps<(ttyipde->t,i dNSCtCaLr_tABLcGaOs_t#,# anlTghor,e aNdCsCBLc_aPsRtO,T O&_d#i#rpercott-o>>o(u)t.,r unnu(l&lnpctcrl,S hamregms.-w>osrekn)d;b u\f f ,| ^a rgs->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b15u:f fnote: ,field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202t:i53d:( tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered ), n202t | h r e a d s ( n tRhurneWaodrsk)E,l etmiednItng(r)o.urpu)n,( w e| ) ^~~~~~~~~~~~~~~~~; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppnote: :field 'group' will be initialized after field 'stepSize'10 :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 10 | ItMiPdL(_tCiOdL)L,_ FnUtNhCr(eAaldlsR(endtuhcree,a dCsO)L,L NtEiTd_IDnIBRlEoCcTk,( tShIrMePaLdEI,d xM.ixn),, hgarlofu)p ( g| r^o up),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ^~~~~~~~~~~95 : note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s tsetpeSpiSziez(en(cncclcSlhSmhemme.mc.ocmomm.mb.ubfuffSfiSziezse[sN[CNCCLC_LP_RPORTOOT_OS_ISMIPMLPEL]E/]N/CNCCLC_LS_TSETPESP/Ss/isziezoefo(fT()T)) ){ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::655641::1111:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655641 | | pprriimmss((ttiidd--ttiiddSSttaarrttRReedduuccee,, nnTThhrreeaaddssRReedduuccee,, nduilrlepcttr-,> d&odwinr,e c&td-i>roeuctt,- >aorugts,- >asregnsd-b>usfefn,d baurfgfs,- >arregcsv-b>urfefc,v b u| f ^f , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 202note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R u n W oRruknEWloermkeEnlter(o)t.or>u(n)(.wreu)n;( w e| ) ^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:: 8note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IM P8L | _ICMOPLLL__CFOULNLC_(FAUlNlCR(eAdlulcRee,d uCcOeL,L NCEOTL_LDNIERTE_CDTI,R ESCITM,P LSEI,M PMLiEn,, Miinnt,6 4i_ntt)6 4 _| t^) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h95::391 :note: 95expanded from macro 'IMPL_COLL_FUNC': note: expanded from macro 'IMPL_COLL_FUNC' 391 | 391R | u n WRournkWy,p eN>C,C LN_CACLLG_OA_L#G#Oa_l#g#oa,l gNoC,C LN_CPCRLO_TPOR_O#T#Op_r#o#tpor>o(t)o.>r(u)n.(r&unnc(c&lnSchcmleSmh.mweomr.kw)o;r k\) ; | \ ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15::562 :note: 15field 'nthreads' will be initialized after field 'tidInBlock': note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::60562:: 60note: :field 'group' will be initialized after field 'stepSize' note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :s twarning: einitializer order does not match the declaration order [-Wreorder-ctor]p Size(ncc l562S | h m e m .tciodm(mt.ibdu)f,f Snitzherse[aNdCsC(Ln_tPhRrOeTaOd_sS)I,M PtLiEd]I/nNBClCoLc_kS(TtEhPrSe/asdiIzdexo.fx()T,) )g r{o u p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h563: | 677 : 11 : snote: tin instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree pSize( n677c | c l S h m e m . c o mpmr.ibmusf(ftSiidz-etsi[dNSCtCaLr_tPBRcOaTsOt_,S InMTPhLrEe]a/dNsCBCcLa_sStT,E P&Sd/isriezceto-f>(oTu)t), {d i r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c t -| > group(groupd own, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:g626s:-9>:s enote: nin instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered buff, 626a | r g s - > r e c vpbruifmfs,( t i| d ^- tidStar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:S202c:a53t:t enote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, nTh r202e | a d s S c a t t eRru,n WNoUrLkLE,l edmiernetc,u pT,, aRregdsO-p>,s eAnldgbou,f fP,r oatrog>s(-)>.rreucnv(bwuef)f;, | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp::20210::531:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 20210 | | I M P L _ C O LRLu_nFWUoNrCk(EAllelmReendtu,( )M.irnu,n (hwael)f;) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp::39110::951:: note: note: expanded from macro 'IMPL_COLL_FUNC'in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | 391I | M P LR_uCnOWLoLr_kF,, hNaClCfL)_ A L| G^O _##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:l391g:o95,: Nnote: Cexpanded from macro 'IMPL_COLL_FUNC'C L_PROT O391_ | # #proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' ROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:l562Sh:m15e:m .warning: winitializer order does not match the declaration order [-Wreorder-ctor]o rk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup), 563| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :s562t:e60p:S inote: zfield 'group' will be initialized after field 'stepSize'e (ncc l562S | h m e m .tciodm(mt.ibdu)f,f Snitzherse[aNdCsC(Ln_tPhRrOeTaOd_sS)I,M PtLiEd]I/nNBClCoLc_kS(TtEhPrSe/asdiIzdexo.fx()T,) )g r{o u p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:,562 :M15i:n ,warning: initializer order does not match the declaration order [-Wreorder-ctor]i nt64_t) | 562^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthread s391( | n t hRruenaWdosr)k,< ntcicdlIFnuBnlco#c#kf(utnhcr,e atdyIpdex,. xF)u,n cg#r#oduepv(rgerdooupp<)t,y p e| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_ALGO _563# | # a l g os,t eNpCSCiLz_eP(RnOcTcOl_S#h#mpermo.tcoo>m(m)..bruufnf(S&inzcecsl[SNhCmCeLm_.PwRoOrTkO)_;S I\M P L| E ^] /NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:T562E:P15S:/ snote: ifield 'nthreads' will be initialized after field 'tidInBlock'z eof(T )562) | { | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d (| t group(groupi d), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a666d:s9(:n tnote: hin instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads) ,666 | t i d I n B l o cpkr(itmhsr(etaiddI,d xn.Txh)r,e agdrsoGuapt(hgerro,u pd)i,r e c| t ^~~~~~~~~~~~~~~~~- >u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:,562 :N60U:L Lnote: ,field 'group' will be initialized after field 'stepSize' args- >562s | e n d b utfifd,( tairdg)s,- >nrtehcrvebaudfsf(,n t h| r ^e ads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t202i:d53I:n Bnote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo ck(t h202r | e a d I d x . x )R,u ngWroorukpE(lgermoeunpt)<,F n ,| ^~~~~~~~~~~T , RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWor k562E | l e m e ntti,( )t.irduInn(Bwleo)c;k ( t| h ^r eadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppx:)11,: 1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep (gro u11p | )I,M P L| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C O L| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ FUNC (563A | l l R e dsutceep,S iCzOeL(LnNcEcTl_SDhImReEmC.Tc,o mSmI.MbPuLfEf,S iMziens,[ NfClCoLa_tP)R O T| O^_ SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:]391/:N95C:C Lnote: _expanded from macro 'IMPL_COLL_FUNC'S TEPS/s i391z | e o fR(uTn)W)o r{k < n| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c l F| u group(groupn c##func/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 641t:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(nthre:a562d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock(threadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~s (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(n RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562E:l15e:m ewarning: ninitializer order does not match the declaration order [-Wreorder-ctor]t t(h)r.eraudns((wnet)h;r e a| d ^s ), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppn:B10l:o1c:k (note: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh readI d10x | .IxM)P,L _gCrOoLuLp_(FgUrNoCu(pA)l,l R e| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u c e| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) COLLN E563T | _ D I R EsCtTe,p SSiIzMeP(LnEc,c lMSihnm,e mh.aclofm)m . b| u^f fSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:[391N:C95C:L _note: Pexpanded from macro 'IMPL_COLL_FUNC'R OTO_SI M391P | L E ]R/uNnCWCoLr_kSnote: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here NCCL_A L626G | O _ # # a l g o ,p rNiCmCsL(_tPiRdO-TtOi_d#S#tparrottSoc>a(t)t.erru,n (n&TnhcrcelaSdhsmSecma.twtoerrk,) ;N U\L L ,| ^d irect/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:>562u:p15,: anote: rfield 'nthreads' will be initialized after field 'tidInBlock'g s->s e562n | d b u f ft,i da(rtgisd-)>,r enctvhbruefafd,s ( n| t ^h reads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202t:i53d:I nnote: Bin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel ock(t h202r | e a d I d x . x )R,u ngWroorukpE(lgermoeunpt)<,F n ,| ^~~~~~~~~~~~~~~~~T , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:e562d:O60p:, note: Afield 'group' will be initialized after field 'stepSize'l go, P562r | o t o > (t)i.dr(utni(dw)e,) ;n t h| r ^e ads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppr:e11a:d1s:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested heret idIn B11l | oIcMkP(Lt_hCrOeLaLd_IFdUxN.Cx()A,l lgRreoduupc(eg,r oCuOpL)L,N E T| _ ^~~~~~~~~~~D IRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartSc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ atter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562_:F15U:N Cwarning: (initializer order does not match the declaration order [-Wreorder-ctor]A llReduc e562, | C O L LtNiEdT(_tDiIdR)E,C Tn,t hSrIeMaPdLsE(,n tMhirne,a dfsl)o,a tt)i d I| n^B lock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t391h:r95e:a dnote: Iexpanded from macro 'IMPL_COLL_FUNC'd x.x), 391g | r o uRpu(ngWroorukp<)n,c c l| F ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n c| # tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# func, 563t | y p e , sFtuenpcS#i#zdee(vnrcecdloSpho,m mN.CbCuLf_fASLiGzOe_s#[#NaClCgLo_,P RNOCTCOL__SPIRMOPTLOE_]#/#NpCrCoLt_oS>T(E)P.Sr/usni(z&enocfc(lTS)h)m e{m . w| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r k )| ; group(group \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h655::56211::15 :note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: field 'nthreads' will be initialized after field 'tidInBlock' 655 | 562 | t i d ( tpirdi)m,s (nttihdr-etaiddsS(tnatrhtrReeaddusc)e,, tniTdhIrneBaldoscRke(dtuhcre, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.commeadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdS:t562a:r15t:S cwarning: ainitializer order does not match the declaration order [-Wreorder-ctor]t ter, nTh r562e | a d s S ctaitdt(etri,d )N,U LnLt,h rdeiardesc(tn-t>hurpe,a dasr)g,s -t>isdeInndBbluofcfk,( tahrrgesa-d>Irdexc.vxb)u,f fg,r o u| p ^( group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202 :| 53 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 202 | 563 | s tReupnSWiozrek(EnlcecmleSnhtmT(O)_.SrIuMnP(LwEe])/;N C C| L ^_ STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp/:s12i:z1e:o fnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested hereT )) { 12 | | I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M P L| _ group(groupC OLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:A666l:l9R:e dnote: uin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec e, CO L666L | N E T _ D I R E CpTr,i mSsI(MtPiLdE,, nMTihnr,e addosuGbalteh)e r ,| ^d irec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:-391>:u95p:, note: Nexpanded from macro 'IMPL_COLL_FUNC'U LL, ar g391s | - > sReunndWbourfkf<,n cacrlgFsu-n>cr#e#cfvubnucf,f ,t y p| e ^, Func##d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:v202r:e53d:o pnote: , 2, 2>::run' requested heret ype>, 202N | C C L _ A L G O _R#u#naWlogrok,E lNeCmCeLn_tP,( )A.lrguon,( &PnrcoctloS>h(m)e.mr.uwno(rwke));; \ | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp562::1115::1 :note: field 'nthreads' will be initialized after field 'tidInBlock'note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 11 | I M PtLi_dC(OtLiLd_)F,U NnCt(hArlelaRdesd(uncteh,r eCaOdLsL)N,E Tt_iDdIIRnEBClTo,c kS(ItMhPrLeEa,d IMdixn.,x )f,l ogarto)u p (| g^r oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 ^~~~~~~~~~~~~~~~~: 95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562expanded from macro 'IMPL_COLL_FUNC': 60: note: field 'group' will be initialized after field 'stepSize' 391 | 562 | R u n W otrikd<(ntcicdl)F,u nnct#h#rfeuandcs,( nttyhpree,a dFsu)n,c #t#iddeIvnrBeldoocpk<(ttyhpree>a,d INdCxC.Lx_)A,L GgOr_o#u#pa(lggroo,u pN)C,C L _| P ^~~~~~~~~~~R OTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:d562o:w15n:, warning: &initializer order does not match the declaration order [-Wreorder-ctor]d irect->o u562t | , a r gtsi-d>(steindd)b,u fnft,h raeragdss-(>nrtehcrvebaudfsf),, t| i ^d InBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d Inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested herex .x), g202r | o u p ( g r o u pR)u,n W o| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~k E l| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m ent.(c)o.mrmu.nb(uwfef)S;i z e| s ^[ NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppR:O10T:O1_:S Inote: Min instantiation of member function 'RunWork, 2, 2>::run' requested hereP LE]/ N10C | CILM_PSLT_ECPOSL/Ls_iFzUeNoCf((ATl)l)R e{d u c| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, C| O group(groupL LNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:C666T:,9 :S Inote: Min instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereP LE, M i666n | , h a l f ) p| r^i ms(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391,: 95n:T hnote: rexpanded from macro 'IMPL_COLL_FUNC'e adsGath e391r | , dRiurneWcotr-k> steynpdeb,u fFfu,n ca#r#gdse-v>rreedcovpb ,| ^N CCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:G202O:_53#:# anote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg o, N C202C | L _ P R O T O _ #R#upnrWootrok>E(l)e.mreunnt(<&Fnnc,c lTS,h mReemd.Owpo,r kA)l;g o\, P| r ^o to>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:u562n:(15w:e )note: ;field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp : 9 : 1t:i dnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested heret id), 9n | tIhMrPeLa_dCsO(LnLt_hFrUeNaCd(sA)l,l RteidduIcneB,lo cCkO(LtLhNrEeTa_dDIIdRxE.CxT),, SgIrMoPuLpE(,g rMoiunp,) ,u i n| t ^~~~~~~~~~~~~~~~~6 4_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h): 562 :| 60^: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95 :562 | note: expanded from macro 'IMPL_COLL_FUNC' tid( t391i | d ) ,R unntWhorrekau,p (NgCrCoLu_pA)L,G O _| # ^~~~~~~~~~~# algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, NCC:L562_:P15R:O Twarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]_ ##proto>().ru n562( | & n c c ltSihdm(etmi.dw)o,r kn)t;h r\e a d| s ^( nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'n Block (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidIn B563l | o c k ( tshtreepaSdiIzdex(.nxc)c,l Sghrmoeump.(cgormomu.pb)u,f f S| i ^~~~~~~~~~~~~~~~~z es[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_60P:R Onote: Tfield 'group' will be initialized after field 'stepSize'O _SIM P562L | E ] / N CtCiLd_(StTiEdP)S,/ snitzheroefa(dTs)()n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s )| , group(group tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a641d:I11d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , grou p641( | g r o u p ) , | ^~~~~~~~~~~p rims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] prims(ti d562- | t i d S ttairdt(Rteiddu)c,e ,n tnhTrheraedasd(snRtehdruecaed,s )d,i rteicdtI-n>Bdloowcnk,( t&hdrieraedcItd-x>.oxu)t,, garrogusp-(>gsreonudpb)u,f f ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a r g| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)- >recvb u563f | f , | s ^t epSize(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:S202h:m53e:m .note: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo mm.b u202f | f S i z e s [ N CRCuLn_WPoRrOkTEOl_eSmIeMnPtL ( )| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r u n| ( group(groupw e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppnote: :in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here11 :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here677 | 11 | I M P L _pCrOiLmLs_(FtUiNdC-(tAildlSRteadrutcBec,a sCtO,L LnNTEhTr_eDaIdRsEBCcTa,s tS,I M&PdLiEr,e cMti-n>,o uftl,o adti)r e c| t^- >down,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a391r:g95s:- >note: sexpanded from macro 'IMPL_COLL_FUNC'e ndbuff ,391 | a r gRsu-n>Wroerckv, 2, 2>::run' requested heren c##de v202r | e d o p < t y p eR>u,n WNoCrCkLE_lAeLmGeOn_t#<#Fanl,g oT,, NRCeCdLO_pP,R OATlOg_o#,# pPrroottoo>>(())..rruunn((&wnec)c;l S h| m ^e m.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp):;10 :\1 : | note: ^in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h10: | 562I:M15P:L _note: Cfield 'nthreads' will be initialized after field 'tidInBlock'O LL_FU N562C | ( A l l Rteiddu(ctei,d )C,O LnLtNhErTe_aDdIsR(EnCtTh,r eSaIdMsP)L,E ,t iMdiInn,B lhoaclkf()t h r| e^a dIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:x391):,95 :g rnote: oexpanded from macro 'IMPL_COLL_FUNC'u p(gro u391p | ) , R u| n ^~~~~~~~~~~~~~~~~W orkn,t hNrCeCaLd_sA)L,G Ot_i#d#IanlBgloo,c kN(CtChLr_ePaRdOITdOx_.#x#)p,r ogtroo>u(p)(.grruonu(p&)n,c c l| S ^~~~~~~~~~~h mem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor]202 | R562u | n W o r ktEilde(mteindt)<,F nn,t hTr,e aRdesd(Onpt,h rAelagdos,) ,P rtoitdoI>n(B)l.orcukn((twher)e;a d I| d ^x .x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppo:u13p:(1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep ), | 13 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | I M| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _COLL _563F | U N C ( AsltleRpeSdiuzcee(,n cCcOlLSLhNmEeTm_.DcIoRmEmC.Tb,u fSfISMiPzLeEs,[ NMCiCnL,_ PrRcOcTlO__bSfIlMoPaLtE1]6/)N C C| L^_ STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:/391s:i95z:e onote: fexpanded from macro 'IMPL_COLL_FUNC'( T)) { 391| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R| u group(groupn Work, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec , type ,677 | F u n c # #devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkout, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ type>, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork , tNiCdC(Lt_iAdL)G,O _n#t#harlegaod,s (NnCtChLr_ePaRdOsT)O,_ #t#ipdrIontBol>o(c)k.(rtuhnr(e&andcIcdlxS.hxm)e,m .gwroorukp)(;g r\o u p| ) ^, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'563 | s t562e | p S i z et(indc(ctliSdh)m,e mn.tchormema.dbsu(fnftShirzeeasd[sN)C,C Lt_iPdRIOnTBOl_oScIkM(PtLhEr]e/aNdCICdLx_.SxT)E,P Sg/rsoiuzpe(ogfr(oTu)p)) ,{ | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h562: | 655 : 11 : tnote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered (tid), n655t | h r e a d s ( n t h rperaidmss)(,t itdi-dtIindBSltoacrkt(Rtehdruecaed,I dnxT.hxr)e,a dgsrRoeudpu(cger,o unpu)l,l p t| r ^~~~~~~~~~~, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:E15l:e mwarning: einitializer order does not match the declaration order [-Wreorder-ctor]n tr(e)a.drsu(nn(twher)e;a d s| ) ^, tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppl:o12c:k1(:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree adId x12. | xI)M,P Lg_rCoOuLpL(_gFrUoNuCp()A,l l R| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d u c| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), COL L563N | E T _ D IsRtEeCpTS,i zSeI(MnPcLcEl,S hMmienm,. cdoomumb.lbeu)f f S| i^z es[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:C391L:_95P:R Onote: Texpanded from macro 'IMPL_COLL_FUNC'O _SIMPL E391] | / N CRCuLn_WSoTrEkP, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep e>, N C687C | L _ A L G O _ # # a lpgroi,m sN(CtCiLd_-PtRiOdTSO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t15e:p Swarning: iinitializer order does not match the declaration order [-Wreorder-ctor]z e(ncclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | 677s | t e p S i z e ( n c cplrSihmmse(mt.icdo-mtmi.dbSutfafrStiBzceass[tN,C CnLT_hPrReOaTdOs_BScIaMsPtL,E ]&/dNiCrCeLc_tS-T>EoPuSt/,s idzieroefc(tT-)>)d o{w n ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a r g| s group(group- >sendbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 641a:r11g:s -note: >in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer ecvbuf f641, | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :p202r:i53m:s (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d-ti d202S | t a r t R e d u cReu,n WnoTrhkrEelaedmseRnetd,d oAwlng,o ,& dPirroetcot>-(>)o.urtu,n (awreg)s;- > s| e ^n dbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :a12r:g1s:- >note: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree cvbuf f12, | I M| P ^L _COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hU:N202C:(53A:l lnote: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree duce ,202 | C O L L N E T _ DRIuRnEWCoTr,k ESlIeMmPeLnEt,< FMni,n ,T ,d oRuebdlOep), A| l^g o, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391t:o95>:( )note: .expanded from macro 'IMPL_COLL_FUNC'r un(we) ;391 | | ^R unWork, 2, 2>::run' requested here# func ,13 | tIyMpPeL,_ CFOuLnLc_#F#UdNeCv(rAeldloRpe ,C ONLCLCNLE_TA_LDGIOR_E#C#Ta,l gSoI,M PNLCEC,L _MPiRnO,T Or_c#c#lp_rboftloo>a(t)1.6r)u n (| &^n cclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:m391e:m95.:w onote: rexpanded from macro 'IMPL_COLL_FUNC'k ); \ | 391 ^ | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:W562o:r15k:< nnote: cfield 'nthreads' will be initialized after field 'tidInBlock'c lFunc #562# | f u n c ,t itdy(ptei,d )F,u nnct#h#rdeeavdrse(dnotph),, NtCiCdLI_nABLlGoOc_k#(#tahlrgeoa,d INdCxC.Lx_)P,R OgTrOo_u#p#(pgrrootuop>)(,) . r| u ^~~~~~~~~~~~~~~~~n (&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S60h:m enote: mfield 'group' will be initialized after field 'stepSize'. work) ;562 | \ | ^t id(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~d Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkEleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N C562C | L _ A L GtOi_d#(#taildg)o,, nNtChCrLe_aPdRsO(TnOt_h#r#epardost)o,> (t)i.drIunnB(l&oncckc(ltShhrmeeamd.Iwdoxr.kx));, \g r o| u ^p (gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[Ns(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nTh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkup, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work#)#;a l\g o ,| ^N CCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:O562T:O15_:# #note: pfield 'nthreads' will be initialized after field 'tidInBlock'r oto>( )562. | r u n ( &tnicdc(ltSihdm)e,m .nwtohrrke)a;d s\( n t| h ^r eads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a dIdx.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~t hr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60(:n tnote: hfield 'group' will be initialized after field 'stepSize'r eads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~o ck(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d Inote: dfield 'group' will be initialized after field 'stepSize'x .x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~n threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThirdeIandBslBoccaks(tt,h r&edaidrIedcxt.-x>)o,u tg,r onuupl(lgprtoru,p )a,r g s| - ^~~~~~~~~~~> sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_#In file included from #pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cppo:t1o: >In file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):.10r: uIn file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h(:&167n: c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:l562S:h15m:e mwarning: .initializer order does not match the declaration order [-Wreorder-ctor]w ork); \ | ^562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's (nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx.x )563, | g r o uspt(egprSoiuzpe)(,n c c| l ^~~~~~~~~~~~~~~~~S hme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562c:o60m:m .note: bfield 'group' will be initialized after field 'stepSize'u ffSiz e562s | [ N C C Lt_iPdR(OtTiOd_)S,I MnPtLhEr]e/aNdCsC(Ln_tShTrEePaSd/ss)i,z etoifd(ITn)B)l o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:)916,: 7 :| ^~~~~~~~~~~note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: warning: field 'nthreads' will be initialized after field 'tidInBlock'initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)60 : note: field 'group' will be initialized after field 'stepSize' 563 | 562s | t e p S itzied((ntcicdl)S,h mnetmh.rceoamdms.(bnutfhfrSeiazdess)[,N CtCiLd_IPnRBOlToOc_kS(ItMhPrLeEa]d/INCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562563: | 60 : note: field 'group' will be initialized after field 'stepSize's tepSi z562e | ( n c c ltSihdm(etmi.dc)o,m mn.tbhurfefaSdisz(enst[hNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp)()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] nthreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~d Idx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r60o:u pnote: (field 'group' will be initialized after field 'stepSize'g roup), 562| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id(t i563d | ) , n tshtreepaSdisz(en(tnhcrcelaSdhsm)e,m .tciodmImn.BbluofcfkS(itzherse[aNdCICdLx_.PxR)O,T Og_rSoIuMpP(LgEr]o/uNpC)C,L _ S| T ^~~~~~~~~~~E PS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e>, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads)p,, tAildgIon,B lPorcokt(ot>h(r)e.arduInd(xw.ex));, g| r ^o up(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562o:f15(:T )warning: )initializer order does not match the declaration order [-Wreorder-ctor] { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,916 :n7t:h rnote: ein instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds(nt h916r | e a d s ) , ptriidmIsn(Bglroocukp(Ttihdr,e agdrIoduxp.Nxt)h,r egardosu,p (&grreocuvp,) ,& s e| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d , | a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r gs-> s563e | n d b u fsft,e paSrigzse-(>nrcecclvSbhumfefm,. c o| m ^m .buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:z202e:s53[:N Cnote: Cin instantiation of member function 'RunWorkElement, 3, 2>::run' requested hereL _PRO T202O | _ S I M P L E ] /RNuCnCWLo_rSkTEElPeSm/esnitz().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:(916w:e7):; note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^ 916 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp : 13 : 1p:r inote: min instantiation of member function 'RunWork, 3, 2>::run' requested heres (grou p13T | iIdM,P Lg_rCoOuLpLN_tFhUrNeCa(dAsl,l R&erdeuccve,, &CsOeLnLdN,E Ta_rCgHsA-I>Ns,e nSdIbMuPfLfE,, aMrigns,- >rrcecclv_bbufflfo,a t 1| 6 ^) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202::39153::95 :note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested herenote: expanded from macro 'IMPL_COLL_FUNC' 202 | 391 | R u nRWuonrWkop(<)t.yrpuen>(,w eN)C;C L _| A ^L GO_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cppg:o9,: 1N:C Cnote: Lin instantiation of member function 'RunWork, 3, 2>::run' requested here_ PROT O9_ | #I#MpPrLo_tCoO>L(L)_.FrUuNnC((&AnlclcRleSdhumceem,. wCoOrLkL)N;E T\_ C H| A ^I N, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E15,: Mnote: ifield 'nthreads' will be initialized after field 'tidInBlock'n , ui n562t | 6 4 _ t )t i d| (^t id), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnth:r391ea:d95s:( nnote: texpanded from macro 'IMPL_COLL_FUNC'h reads )391, | t iRduInnWBolrokc ,note: field 'group' will be initialized after field 'stepSize'N CCL_A L562G | O _ # # atligdo(,t iNdC)C,L _nPtRhOrTeOa_d#s#(pnrtohtroe>a(d)s.)r,u nt(i&dnIcncBllSohcmke(mt.hwroerakd)I;d x\. x )| , ^ group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o15u:p )note: ,field 'nthreads' will be initialized after field 'tidInBlock' | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppd:S1t: aIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:B10c: aIn file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.ht:,167 : n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:h562r:e15a:d swarning: Binitializer order does not match the declaration order [-Wreorder-ctor]c ast, &dire c562t | - > o u tt,i dd(itriedc)t,- >ndtohwrne,a dasr(gnst-h>rseeanddsb)u,f ft,i daIrngBsl-o>crke(ctvhbruefafd,I d x| . ^x ), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(202g:r53o:u pnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 202 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563R | u n W o rsktEelpeSmieznet(C(C)L._rPuRnO(TwOe_)S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h;I: M562 P:| L15 ^E: ] /warning: Ninitializer order does not match the declaration order [-Wreorder-ctor]C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp C:L4_:S1 T:562E | Pnote: Sin instantiation of member function 'RunWork, 2, 2>::run' requested here / s it zi4ed | o(IftM(iPTdL))_),C O{nL tL h_| rF ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eU aN dC| s( group(group(A nltlhRreedaud/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hcs:e)626,,: 9Ct:Oi Ldnote: LIin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereNn EBTl_o Dc626Ik | R( Et Ch Tr ,e a Sd II MdpPxrL.iExm,)s ,(M tagixrd,o- utipin(dtgS8rt_oatur)pt )S ,c| a^ t | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ,:| 391 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n: T95h:r e563note: a | expanded from macro 'IMPL_COLL_FUNC'd s S c astt te391ep | rS ,i zRNeuU(nLnWLco,cr lkdSonumcpm#,.# bfauurfngfcsS,-i >ztseyespn[edN,bC uCFfLuf_n,Pc R#aO#rTdgOes_v-Sr>IerMdePocLpvE L, _ | SN ^TC ECPLS_/A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hsL:iG202zO:e_53o#:f# (anote: Tlin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here)g )o , { 202 N | C | C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L _ P | R group(groupO T OR_u#n#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hWp:or687ro:kt11Eo:l> e(note: m)in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree. nrtu< nF687(n | &, n cT c, l SR he md eO mp .,pw roAirlmkgs)o(;,t i\Pd r- ot| ti ^od >S(t)a.r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hrt:uB562nc:(a15ws:et ),note: ; field 'nthreads' will be initialized after field 'tidInBlock' n T| h ^r e562a | d s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp B: tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group ->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: In file included from initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10 : 562In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h : 167 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dwarning: )initializer order does not match the declaration order [-Wreorder-ctor], nthreads(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx. x563) | , g r osutpe(pgSriozuep()n,c c l| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h m e| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). comm. b563u | f f S i zsetse[pNSCiCzLe_(PnRcOcTlOS_hSmIeMmP.LEc]o/mNmC.CbLu_fSfTSEiPzSe/ss[iNzCeCoLf_(PTR)O)T O{_ S I| M ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P L E| ] group(group/ NCCL_STEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:f677(:T11):) note: {in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 677 group(group | prims(tid-ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:S626t:a9r:t Bnote: cin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea st, nT h626r | e a d s B c a s tp,r i&mdsi(rteicdt--t>ioduStt,a rdtiSrceacttt-e>rd,o wnnT,h raeragdss-S>csaetntdebru,f fN,U LaLr,g sd-i>rreecctv-b>uufpf,, a r| g ^s ->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f202f:,53 :a rnote: gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres ->r e202c | v b u f f , | R ^u nWorkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:e202n:t53<:F nnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here T, R e202d | O p , A l g o ,R uPnrWootrok>E(l)e.mreunnt(, 2, 2>::run' requested hereo to>( )5. | rIuMnP(Lw_eC)O;L L _| F ^U NC(AllR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppe:d4u:c1e:, note: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereO LLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE :T4562_ | :DI15IM:RP ELwarning: C_initializer order does not match the declaration order [-Wreorder-ctor]TC ,O LSLI_M FP562UL | NE C, ( AM latlxiR,de (dutuiicnedt,)8 ,_C tOn)Lt Lh Nr| Ee^Ta _dDsI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(R:nE391tC:hT95r,:e aSnote: dIexpanded from macro 'IMPL_COLL_FUNC'sM )P,L E t,391i | dM Ia nxRB,ul noiWcnoktr(8kt_c ,#s #tNfeCupCnSLci_,zA eLt(GynOpc_ec#,l# SaFhlumgneocm,#. #cNdoCemCvmLr._ebPduRofOpfSTpN,rC oCNtLCo_C>PL(R_)OA.LTrGOuO_n_S(#I&#MnaPclLcgElo]S,/h NmNCeCCmCL.L_w_SoPTrREkOP)TS;O/ _s\#i #z pe| ro ^of t(oT>)(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h)): .562{r: u15 n:| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~¬e: nfield 'nthreads' will be initialized after field 'tidInBlock' c | c group(groupl S h562m | e m /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h. :w 641ot:ri11kd:)( ;tnote: iin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here\d ) ,| ^n641 t | h r e a d s ( n t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h h: r562pe:ra15id:ms s)note: (,tfield 'nthreads' will be initialized after field 'tidInBlock' i tdi-dtI in562dB | Sl to ac rk t(tRtiehddr(uetcaiedd,I) d,nx T.nhxtr)he,ra edgasrdRoseu(dpnu(tcgherr,oe uadpdi)sr,)e ,c t| t- ^~~~~~~~~~~~~~~~~i> ddIo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnw:Bn562l,:o 60c&:kd (inote: trfield 'group' will be initialized after field 'stepSize'he rceta -d562>I | od ux t. ,x )ta,ir dgg(srt-oi>udps)(e,gn rdnobtuuhpfr)fe,,a d as| r( ^~~~~~~~~~~~~~~~~gn st-h>r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hre:ea562cd:vs60b):u, f note: ftfield 'group' will be initialized after field 'stepSize',i dInBl o562c | k ( t h rteiadd(Itdixd.)x,) ,n tghrroeuapd(sg(rnotuhpr)e,a d s| ) ^~~~~~~~~~~, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]x .x), group(g r562o | u p ) , t i| d ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here u p202) | , | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15I:d xwarning: .initializer order does not match the declaration order [-Wreorder-ctor]x ), group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~i d),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's (nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx.x )563, | g r o uspt(egprSoiuzpe)(,n c c| ^~~~~~~~~~~ lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:5562 | :I15M:P Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]C OLL_FUNC( A562l | l R e d utcied,( tCiOdL)L,N EnTt_hDrIeRaEdCsT(,n tShIrMePaLdEs,) ,M atxi,d IuniBnlto8c_kt()t h r| e^a dIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:)391,: 95g:r onote: uexpanded from macro 'IMPL_COLL_FUNC'p (group )391, | | R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n W| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r ks,[ NNCCCCLL__PARLOGTOO__#S#IaMlPgLoE,] /NNCCCCLL__PSRTOETPOS_/#s#ipzreootfo(>T())). r{u n (| & ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n c c| l group(groupS hmem.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:)677;: 11\: note: | in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :677562 | : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' p r562i | m s ( t itdi-dt(itdiSdt)a,r tnBtcharsetad,s (nnTthhrreeaaddssB)c,a stti,d I&ndBilroecckt(-t>horueta,d Iddixr.exc)t,- >gdroowunp,( garrogusp-)>,s e n| d ^~~~~~~~~~~~~~~~~b uf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 :a60r:g snote: -field 'group' will be initialized after field 'stepSize'> recvEb,u fMfa,x , | i ^n t8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 391:95: 202note: | expanded from macro 'IMPL_COLL_FUNC' R u391n | W o rRkuEnlWeomrekn#(d)e.vrruend(owpe<)t;y p e| > ^, NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppG:O5_:#1#:a lnote: gin instantiation of member function 'RunWork, 2, 2>::run' requested hereo , NC C5L | _IPMRPOLT_OC_O#L#Lp_rFoUtNoC>((A)l.lrRuend(u&cnec,c lCSOhLmLeNmE.Tw_oDrIkR)E;C T\, S| I ^M PLE, Max, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:i562n:t158:_ tnote: )field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nth r391e | a d sR(unntWhorreka,, N| C ^~~~~~~~~~~~~~~~~C L_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:G562O:_60#:# anote: lfield 'group' will be initialized after field 'stepSize'g o, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c k| ( ^t hreadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p(grou p562) | , | ^~~~~~~~~~~t id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562 | : 562 : 15t:i dwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t id), nthreads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~I dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NC C562L | _ A L G Ot_i#d#(atligdo),, NnCtChLr_ePaRdOsT(On_t#h#rperaodtso)>,( )t.irduInn(B&lnoccckl(Sthhmreema.dwIodrxk.)x;) ,\ g r| o ^u p(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 15 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: field 'nthreads' will be initialized after field 'tidInBlock' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i ds(tteipdS)i,z en(tnhcrcelaSdhsm(enmt.hcroemamd.sb)u,f ftSiidzIensB[lNoCcCkL(_tPhRrOeTaOd_ISdIxM.PxL)E,] /gNrCoCuLp_(SgTrEoPuSp/)s,i z e| o ^~~~~~~~~~~~~~~~~f (T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :{562 : 60| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize'| group(group 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,655 :n11t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds(nt h655r | e a d s ) , t i d IpnrBilmosc(kt(itdh-rteiaddSItdaxr.txR)e,d ugcreo,u pn(Tghrroeuapd)s,R e d| u ^~~~~~~~~~~c e, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:>562(:)15.:r uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]( &ncclShm e562m | . w o r kt)i;d (\t i d| ) ^, nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(15n:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e ads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l o c| k tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( thre a563d | I d x . xs)t,e pgSriozuep((ngcrcoluSph)m,e m .| c ^~~~~~~~~~~~~~~~~o mm.bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:f562S:i60z:e snote: [field 'group' will be initialized after field 'stepSize'N CCL_PR O562T | O _ S I MtPiLdE(]t/iNdC)C,L _nStThErPeSa/dssi(znetohfr(eTa)d)s ){, t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I n| B group(groupl ock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d666I:d9x:. xnote: )in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, grou p666( | g r o u p ) , p| r ^~~~~~~~~~~i ms(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().ru n562( | w e ) ; t i| d ^( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :n6t:h1r:e anote: din instantiation of member function 'RunWork, 2, 2>::run' requested heres (nt h6r | eIaMdPsL)_,C OtLiLd_IFnUBNlCo(cAkl(ltRherdeuacdeI,d xC.OxL)L,N EgTr_oDuIpR(EgCrTo,u pS)I,M P L| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, M| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x , int 35632 | _ t ) s| t^e pSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n391c:c95l:S hnote: mexpanded from macro 'IMPL_COLL_FUNC'e m.comm .391b | uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hU:N562C:(15A:l lwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]e duce, COLL N562E | T _ D I RtEiCdT(,t iSdI)M,P LnEt,h rMeaaxd,s (unitnhtr8e_atd)s ) ,| ^t idInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:o391c:k95(:t hnote: rexpanded from macro 'IMPL_COLL_FUNC'e adIdx.x )391, | g rRouunpW(ogrrko.,c oNmCmC.Lb_uAfLfGSOi_z#e#sa[lNgCoC,L _NPCRCOLT_OP_RSOITMOP_L#E#]p/rNoCtCoL>_(S)T.ErPuSn/(s&inzcecolfS(hTm)e)m .{w o r| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ; | \ group(group | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::666562::915:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herefield 'nthreads' will be initialized after field 'tidInBlock' 666 | 562 | t i dp(rtiimds)(,t indt,h rneTahdrse(andtshGraetahdesr),, dtiirdeIcntB-l>oucpk,( tNhUrLeLa,d Iadrxg.sx-)>,s egnrdobuupf(fg,r oaurpg)s,- > r| e ^~~~~~~~~~~~~~~~~c vbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 : 60| : ^ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :562202 | : 53 : note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d(ti d202) | , n t h r e a dRsu(nnWtohrrkeEaldesm)e,n tto(u)p.(rgurno(uwpe)),; | | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562b:u15f:f Swarning: iinitializer order does not match the declaration order [-Wreorder-ctor]z es[NCCL_PR O562T | O _ S I MtPiLdE(]t/iNdC)C,L _nStThErPeSa/dssi(znetohfr(eTa)d)s ){, t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I n| B group(groupl ock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:)626,: 9g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep (group )626, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) prims (563t | i d - t isdtSetpaSritzSec(antctcelrS,h mneTmh.rceoamdms.SbcuaftftSeirz,e sN[UNLCLC,L _dPiRrOeTctO-_>SuIpM,P LaEr]g/sN-C>CsLe_nSdTbEuPfSf/,s iazregosf-(>Tr)e)c v{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::641202:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 641 | 202 | p rRiumnsW(otrikdE-lteimdeSnttad(i)r.ercutn-(>wdeo)w;n , | & ^d irect->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppo:u7t:,1 :a rnote: gin instantiation of member function 'RunWork, 2, 2>::run' requested heres ->se n7d | bIuMfPfL,_ CaOrLgLs_-F>UrNeCc(vAblulfRfe,d u c| e ^, COLLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:D202I:R53E:C Tnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here SIMP L202E | , M a x , u iRnutn3W2o_rtk)E l e| m^e ntW(o)r.kr, 2, 2>::run' requested heree vredop <6t | yIpMeP>L,_ CNOCLCLL__FAULNGCO(_A#l#laRlegdou,c eN,C CCLO_LPLRNOETTO__D#I#RpErCoTt,o >S(I)M.PrLuEn,( &Mnacxc,l Sihnmte3m2._wto)r k )| ;^ \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :expanded from macro 'IMPL_COLL_FUNC'562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 391 | R u562n | W o r k h,r eNaCdCILd_xA.LxG)O,_ #g#raolugpo(,g rNoCuCpL)_,P R O| T ^~~~~~~~~~~~~~~~~O _##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562o:t60o:> (note: )field 'group' will be initialized after field 'stepSize'. run(& n562c | c l S h tid(tid), nmtehmr.ewaodrsk()n;t h\r e a| d ^s ), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cnote: kfield 'nthreads' will be initialized after field 'tidInBlock'( thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~( nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eads(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~e adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60562: | note: field 'group' will be initialized after field 'stepSize' tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_15#:# pwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o to>().ru n562( | & n c c ltSihdm(etmi.dw)o,r kn)t;h r\e a d| s ^( nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , tid I563n | B l o c ks(ttehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~~~~~~~s [NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562P:R60O:T Onote: _field 'group' will be initialized after field 'stepSize'S IMPLE] /562N | C C L _ StTiEdP(St/isdi)z,e onft(hTr)e)a d{s ( n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:B641l:o11c:k (note: tin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh readId x641. | x ) , g r o u p ( gprroiumps)(,t i d| - ^~~~~~~~~~~t idStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:l562e:m15e:n twarning: t(h)r.eraudns((wnet)h;r e a| d ^s ), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppn:B7l:o1c:k (note: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh read I7d | xI.MxP)L,_ CgOrLoLu_pF(UgNrCo(uApl)l,R e d| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c e ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C OLLNE T563_ | D I R E CsTt,e pSSIiMzPeL(En,c cMlaSxh,m eumi.ncto3m2m_.tb)u f f| S^i zes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C391C:L95_:P Rnote: Oexpanded from macro 'IMPL_COLL_FUNC'T O_SIMP L391E | ] / NRCuCnLW_oSrTkE, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep e>, N C655C | L _ A L G O _ # # a lpgroi,m sN(CtCiLd_-PtRiOdTSOt_a#r#tpRreodtuoc>e(,) .nrTuhnr(e&andcscRleSduchem,e mn.uwlolrpkt)r;, \& d i| r ^e ct->o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:t562,: 15a:r gnote: sfield 'nthreads' will be initialized after field 'tidInBlock'- >send b562u | f f , atrigds(-t>irde)c,v bnutfhfr,e a d| s ^( nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 :t53i:d Inote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereB lock( t202h | r e a d I d x . xR)u,n WgorrokuEpl(egmreonutp<)F,n , | T ^~~~~~~~~~~~~~~~~, Re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:O562p:,60 :A lnote: gfield 'group' will be initialized after field 'stepSize'o , Pro t562o | > ( ) . rtuind((wtei)d;) , | n ^t hreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppn:t7h:r1e:a dnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here) , ti d7I | nIBMlPoLc_kC(OtLhLr_eFaUdNICd(xA.lxl)R,e dgurcoeu,p (CgOrLoLuNpE)T,_ D I| R ^~~~~~~~~~~E CT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO_:#562#:p15r:o twarning: oinitializer order does not match the declaration order [-Wreorder-ctor]> ().run(&nc c562l | S h m e mt.iwdo(rtki)d;) ,\ n t| h ^r eads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s ), ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~~~~~~~[ NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O60T:O _note: Sfield 'group' will be initialized after field 'stepSize'I MPLE] /562N | C C L _ StTiEdP(St/isdi)z,e onft(hTr)e)a d{s ( n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:k666(:t9h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered Idx.x )666, | g r o u p ( g rporuipm)s,( t i| d ^~~~~~~~~~~, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h\: 562 :| 15 ^: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid), n t562h | r e a d st(indt(htrieda)d,s) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up(g r563o | u p ) , s t| e ^~~~~~~~~~~~~~~~~p Size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562c:c60l:S hnote: mfield 'group' will be initialized after field 'stepSize'e m.comm. b562u | f f S i zteisd[(NtCiCdL)_,P RnOtThOr_eSaIdMsP(LnEt]h/rNeCaCdLs_)S,T EtPiSd/IsniBzleoocfk((Tt)h)r e{a d I| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x . x| ) group(group, group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u677p:)11,: note: | in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^~~~~~~~~~~ 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hD:I562R:E15C:T ,warning: initializer order does not match the declaration order [-Wreorder-ctor]S IMPLE, M562a | x , i ntti3d2(_tti)d ) ,| ^n threads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC') , tidIn B391l | o c kR(utnhWroerakdp,S iN zC562 | eC (L n_ cAtcLilGdSO(h_tm#ie#dma).l,cg oonm,tm h.NrbCeuCafLdf_sSP(iRnzOteThsOr[_eN#aC#dCspL)r_,oP tRtoOi>Td(OI)_n.SBrIluMonPc(Lk&E(n]tc/hcNrlCeSCahLdm_IeSdmTx.E.wPxoS)r/,ks )ig;zr eo\ou fp ((| Tg ^)r )o u{p ) ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562 | : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~15| : group(group | note: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 563641562 | : | 11 : note: sin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herett iedp(Sti iz641de | )( ,n c nc tl hS rh em ae dm s.p(crnoitmmhmsr.(ebtauidfdsf-)St,ii zdteSistd[aINrnCtBCRlLeo_dcPukRc(OetT,hO r_neSTahIdrMIePdaLxdE.s]Rx/e)Nd,Cu CcgLer_,oS uTdpEi(PrgSer/costui-pz>)ed,oo fw (n| T, ^~~~~~~~~~~~~~~~~) )& d{i /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr :e| 562c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:t 60- :>| o group(groupnote: u field 'group' will be initialized after field 'stepSize't , args -562> | s e n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h d: b687tu:if11df:(, t note: iain instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heredr )g,s -n>t rh687er | ce va bd us f( fn ,t h r | e ^ap drsi)m,s (tt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hii:dd202I-:nt53Bi:ld oSnote: ctin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereka (rtthB rc202ea | as dt I, d xn .T xh )r ,eR augdnrsWoBoucrpak(sEgtlr,eo mu&epdn)it,r< eF cn| t, ^~~~~~~~~~~- >To,u tR,e dnOupl,l pAtlrg,o ,a rPgrso-t>os>e(n)d.bruufnf(,w ea)r;g s -| ^ >recvb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppu:f8f:,1 : | note: ^in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:P202L:_53C:O Lnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_ FUNC( A202l | l R e d u c e , RCuOnLWLoNrEkTE_lDeImReEnCtT<,F nS,I MTP,L ER,e dMOapx,, Ailngto6,4 _Ptr)o t o| >^( ).ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:(391w:e95):; note: expanded from macro 'IMPL_COLL_FUNC'| ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppR:u6n:W1o:r knote: , 2, 2>::run' requested heren cclFu n6c | #I#MfPuLn_cC,O LtLy_pFeU,N CF(uAnlcl#R#eddeuvcree,d oCpO_,D INRCECCLT_,A LSGIOM_P#L#Ea,l gMoa,x ,N CiCnLt_3P2R_OtT)O _ #| #^p roto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:)391.:r95u:n (note: &expanded from macro 'IMPL_COLL_FUNC'n cclShme m391. | w o rRku)n;W o\r k <| n ^c clFu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:c562#:#15f:u nnote: cfield 'nthreads' will be initialized after field 'tidInBlock', type ,562 | F u n c #t#idde(vtriedd)o,p a,d sN(CnCtLh_rAeLaGdOs_)#,# atligdoI,n BNlCoCcLk_(PtRhOrTeOa_d#I#dpxr.oxt)o,> (g)r.oruupn((g&rnocucpl)S,h m e| m ^~~~~~~~~~~~~~~~~. wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;60 :\ note: field 'group' will be initialized after field 'stepSize'| ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15p:r iwarning: minitializer order does not match the declaration order [-Wreorder-ctor]s (tid, nTh r562e | a d s G attihde(rt,i dd)i,r enctth-r>euapd,s (NnUtLhLr,e aadrsg)s,- >tsiednIdnbBulfofc,k (atrhgrse-a>drIedcxv.bxu)f,f ,g r o| u ^p (group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~202 : 53| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 563 | 202 | s t e p S i zReu(nnWcocrlkSEhlmeemme.nctoM(P)L.Er]u/nN(CwCeL)_;S T E| P ^S /sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppT:)7): 1{: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 7 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:C677O:L11L:_ Fnote: Uin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereN C(All R677e | d u c e , C O L L NpErTi_mDsI(RtEiCdT-,t iSdISMtPaLrEt,B cMaasxt,, uniTnhtr3e2a_dts)B c a| s^t , &di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391c:t95-:> onote: uexpanded from macro 'IMPL_COLL_FUNC't , direc t391- | > d oRwunn,W oarrkgcsleFnudnbcu#f#ff,u nacr,g st-y>pree,c vFbuunfcf#,# d e| v ^r edop/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h<:t202y:p53e:> ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereN CCL _202A | L G O _ # # a l gRou,n WNoCrCkLE_lPeRmOeTnOt_<#F#np,r oTt,o >R(e)d.Orpu,n (A&lngcoc,l SPhrmoetmo.>w(o)r.kr)u;n (\w e )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp::6:151:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'RunWork, 2, 2>::run' requested here 6562 | | I M P L _tCiOdL(Lt_iFdU)N,C (nAtlhlrReeaddusc(en,t hCrOeLaLdNsE)T,_ DtIiRdEICnTB,l oScIkM(PtLhEr,e aMdaIxd,x .ixn)t,3 2g_rto)u p (| g^r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 : 95| : ^~~~~~~~~~~~~~~~~ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60 :391 | note: field 'group' will be initialized after field 'stepSize' RunWo r562k | < n c c ltFiudn(ct#i#df)u,n cn,t htryepaed,s (Fnutnhcr#e#addesv)r,e dtoipdc,k (NtChCrLe_aAdLIGdOx_.#x#)a,l ggor,o uNpC(CgLr_oPuRpO)T,O _ #| # ^~~~~~~~~~~p roto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:D562I:R15E:C Twarning: ,initializer order does not match the declaration order [-Wreorder-ctor] SIMPLE, M562a | x , i ntti6d4(_tti)d ) ,| ^n thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391s:(95n:t hnote: rexpanded from macro 'IMPL_COLL_FUNC'e ads), 391t | i d IRnuBnlWoocrkk( | , N C CsLt_eApLSGiOz_e#(#naclcgloS,h mNeCmC.Lc_oPmRmO.TbOu_f#f#Spirzoetso[>N(C)C.Lr_uPnR(O&TnOc_cSlISMhPmLeEm]./wNoCrCkL)_;S T\E P S| / ^s izeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:(562T:)15): {note: field 'nthreads' will be initialized after field 'tidInBlock' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:t655i:d11):, note: nin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hread s655( | n t h r e a d s ) , ptriidmIsn(Btliodc-kt(itdhSrteaardtIRdexd.uxc)e,, gnrTohurpe(agdrsoRuepd)u,c e | , ^~~~~~~~~~~~~~~~~ null/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:t562r:,60 :& dnote: ifield 'group' will be initialized after field 'stepSize'r ect->o u562t | , a r gtsi-d>(steindd)b,u fnft,h raeragdss-(>nrtehcrvebaudfsf),, t| i ^d InBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t202h:r53e:a dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered x.x) ,202 | g r o u p ( g r oRuupn)W,o r k| E ^~~~~~~~~~~l ement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->do/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:n562,: 15&:d iwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ct->out, 562a | r g s - >tsiedn(dtbiudf)f,, natrhgrse-a>drse(cnvtbhurfefa,d s )| , ^ tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:c202k:(53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea dIdx .202x | ) , g r o u p (RgurnoWuopr)k,E l e| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e n t| < tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)F n, T, 563R | e d O p ,s tAelpgSoi,z eP(rnoctcol>S(h)m.ermu.nc(owmem).;b u f| f ^S izes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppN:C6C:L1_:P Rnote: Oin instantiation of member function 'RunWork, 2, 2>::run' requested hereT O_S I6M | PILMEP]L/_NCCOCLLL__SFTUENPCS(/AslilzReeodfu(cTe),) C{O L L| N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E T _| D group(groupI RECT, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hM:P666L:E9,: Mnote: ain instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex , in t6663 | 2 _ t ) | ^ prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(391t:i95d:, note: nexpanded from macro 'IMPL_COLL_FUNC'T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391::95562:: 15note: :expanded from macro 'IMPL_COLL_FUNC' warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | R562u | n W o r ktr,e aNdCICdLx_.AxL)G,O _g#r#oaulpg(og,r oNuCpC)L,_ P R| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T O _| # tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# prot o563> | ( ) . r usnt(e&pnSciczleS(hnmcecml.Swhomrekm).;c o\m m .| b ^u ffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:[562N:C15C:L _note: Pfield 'nthreads' will be initialized after field 'tidInBlock'R OTO_ S562I | M P L E ]t/iNdC(CtLi_dS)T,E PnSt/hsriezaedosf((nTt)h)r e{a d s| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, t| i group(groupd InBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d666I:d9x:. xnote: )in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, gro u666p | ( g r o u p ) , p r| i ^~~~~~~~~~~~~~~~~m s(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562,: 60n:T hnote: rfield 'group' will be initialized after field 'stepSize'e adsG a562t | h e r , tdiidr(etcitd-)>,u pn,t hNrUeLaLd,s (anrtghsr-e>asdesn)d,b utfifd,I naBrlgosc-k>(rtehcrvebaudfIfd,x . x| ) ^, group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:g202r:o53u:p )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391: 95391: | note: expanded from macro 'IMPL_COLL_FUNC'R unWork <391n | c c lRFuunnWco#r#kfv,r eNdCoCpL<_tAyLpGeO>_,# #NaClCgLo_,A LNGCOC_L#_#PaRlOgToO,_ #N#CpCrLo_tPoR>O(T)O._r#u#np(r&ontcoc>l(S)h.mreumn.(w&onrckc)l;S h\m e m| . ^w ork); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: note: 562field 'nthreads' will be initialized after field 'tidInBlock' | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~u p),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^~~~~~~~~~~~~~~~~60 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :field 'group' will be initialized after field 'stepSize'562 :60: note: field 'group' will be initialized after field 'stepSize'562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::60562:: 15note: :field 'group' will be initialized after field 'stepSize' warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t562i | d ( t i dt)i,d (nttihdr)e,a dnst(hnrtehardesa(dnst)h,r etaiddsI)n,B ltoicdkI(ntBhlroecakd(Itdhxr.exa)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ^~~~~~~~~~~p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:e562):;15 : | warning: ^initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6 :5621 | : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here tid( t6i | dI)M,P Ln_tChOrLeLa_dFsU(NnCt(hArlelaRdesd)u,c et,i dCIOnLBLlNoEcTk_(DtIhRrEeCaTd,I dSxI.MxP)L,E ,g rMoauxp, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:>562u:p15,: awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]g s->send b562u | f f , atrigds(-t>irde)c,v bnutfhfr,e a d| s ^( nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 :t53i:d Inote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereB lock (202t | h r e a d I d x .Rxu)n,W ogrrkoEulpe(mgernotup(S)i.zreu(nn(cwcel)S;h m e| m ^. comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppu:f7f:S1i:z enote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here[ NCCL _7P | RIOMTPOL__SCIOMLPLL_EF]U/NNCC(CALl_lSRTeEdPuSc/es,i zCeOoLfL(NTE)T)_ D{I R E| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T , | S group(groupI MPLE, M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:x626,: 9u:i nnote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here3 2_t) 626| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391p:r95i:m snote: (expanded from macro 'IMPL_COLL_FUNC't id-tid S391t | a r tRSucnaWtotrekr<,n cncTlhFruenacd#s#Sfcuantct,e rt,y pNeU,L LF,u ndci#r#edcetv-r>eudpo,p >,s eNnCdCbLu_fAfL,G Oa_r#g#sa-l>groe,c vNbCuCfLf_,P R O| T ^O _##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:>202(:)53.:r unote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( &ncc l202S | h m e m . w o r kR)u;n W\o r k| E ^l ementi(d)).,r unnt(hwree)a;d s (| n ^t hreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp):,6 :t1i:d Inote: nin instantiation of member function 'RunWork, 2, 2>::run' requested hereB lock (6t | hIrMePaLd_ICdOxL.Lx_)F,U NgCr(oAulpl(Rgerdouucpe),, C O| L ^~~~~~~~~~~~~~~~~L NET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:D562I:R60E:C Tnote: ,field 'group' will be initialized after field 'stepSize' SIMPL E562, | M a x ,t iidn(tt3i2d_)t,) n t| h^r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC's ), tid I391n | B l oRcukn(Wtohrrke, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthhreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:e562):;15 : | warning: ^initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8 :5621 | : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here tid( t8i | dI)M,P Ln_tChOrLeLa_dFsU(NnCt(hArlelaRdesd)u,c et,i dCIOnLBLlNoEcTk_(DtIhRrEeCaTd,I dSxI.MxP)L,E ,g rMoauxp,( girnotu6p4)_,t ) | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :56395 | : note: expanded from macro 'IMPL_COLL_FUNC' stepSi z391e | ( n cRculnSWhomrekm<.nccocmlmF.ubnucf#f#Sfiuznecs,[ NtCyCpLe_,P RFOuTnOc_#S#IdMePvLrEe]d/oNpCE,P SN/CsCiLz_eAoLfG(OT_)#)# a{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group lgo, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:L687_:P11R:O Tnote: Oin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ ##pro t687o | > ( ) . r u n ( & n cpcrliSmhsm(etmi.dw-otrikd)S;t a\r t B| c ^a st, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:h562r:e15a:d snote: Bfield 'nthreads' will be initialized after field 'tidInBlock'c ast, &562d | i r e c tt-i>do(utti,d )n,u lnltphtrre,a dasr(gnst-h>rseeanddsb)u,f ft,i daIrngBsl-o>crke(ctvhbruefafd,I d x| . ^x ), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(202g:r53o:u pnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^~~~~~~~~~~~~~~~~ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 : Rnote: ufield 'group' will be initialized after field 'stepSize'n WorkEl e562m | e n t < Ftni,d (Tt,i dR)e,d Onpt,h rAelagdos,( nPtrhorteoa>d(s)).,r utni(dwIen)B;l o c| k ^( thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppI:d7x:.1x:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereg roup (7g | rIoMuPpL)_,C O L| L ^~~~~~~~~~~_ FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsRedu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hLLNET:_562D:I15R:E Cwarning: Tinitializer order does not match the declaration order [-Wreorder-ctor], SIMPLE, Max ,562 | u i n t 6t4i_dt()t i d| )^, nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391a:d95s:( nnote: texpanded from macro 'IMPL_COLL_FUNC'h reads) ,391 | t i dRIunnBWloorckk<(ntchcrleFaudnIcd#x#.fxu)n,c ,g rtoyuppe(,g rFouunpc)#,# d e| v ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e d| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p | , N C CsLt_eApLSGiOz_e#(#naclcgloS,h mNeCmC.Lc_oPmRmO.TbOu_f#f#Spirzoetso[>N(C)C.Lr_uPnR(O&TnOc_cSlISMhPmLeEm]./wNoCrCkL)_;S T\E P S| / ^s izeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:(562T:)15): {note: field 'nthreads' will be initialized after field 'tidInBlock' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 666n:t9h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s(nth r666e | a d s ) , t i dpIrniBmlso(ctki(dt,h rneTahdrIedaxd.sxG)a,t hgerro,u pd(igrreocutp-)>,u p ,| ^~~~~~~~~~~~~~~~~N ULL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562a:r60g:s -note: >field 'group' will be initialized after field 'stepSize's endb u562f | f , a rtgisd-(>triedc)v,b unftfh,r e a| d ^s (nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)202,: 53t:i dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Bloc k202( | t h r e a d I d xR.uxn)W,o rgkrEoluepm(egnrto().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 15 :| group(groupwarning: initializer order does not match the declaration order [-Wreorder-ctor] 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 677 : 11 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid), n677t | h r e a d s ( n t h rperaidmss)(,t itdi-dtIindBSltoacrkt(Btcharseta,d IndTxh.rxe)a,d sgBrcoauspt(,g r&oduipr)e,c t -| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u t| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) dire c563t | - > d o wsnt,e paSrigzse-(>nscecnldSbhumfefm,. caormgms.-b>urfefcSvibzuefsf[,N C C| L ^_ PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:I202M:P53L:E ]note: /in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereN CCL _202S | T E P S / s i z eRoufn(WTo)r)k E{l e m| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t <| F group(groupn , T, RedOp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 666A:l9g:o ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereP roto>( )666. | r u n ( w e ) ; p r| i ^m s(tid,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :n9T:h1r:e anote: din instantiation of member function 'RunWork, 2, 2>::run' requested heres Gath e9r | ,I MdPiLr_eCcOtL-L>_uFpU,N CN(UALlLl,R eadrugcse-,> sCeOnLdLbNuEfTf_,D IaRrEgCsT-,> rSeIcMvPbLuEf,f ,M a x| , ^ uint64_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h): 202 :| 53^: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :20295 | : note: expanded from macro 'IMPL_COLL_FUNC' Ru n391W | o r kREulneWmoernkt<#(#)d.ervurne(dwoep)<;t y p| e ^> , NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp_:A9L:G1O:_ #note: #in instantiation of member function 'RunWork, 2, 2>::run' requested herea lgo, 9N | CICMLP_LP_RCOOTLOL__#F#UpNrCo(tAol>l(R)e.druucne(,& nCcOcLlLSNhEmTe_mD.IwRoErCkT),; S\I M P| L ^E , Max,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :u562i:15n:t 6note: 4field 'nthreads' will be initialized after field 'tidInBlock'_ t) | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t391i:d95(:t inote: dexpanded from macro 'IMPL_COLL_FUNC') , nthre a391d | s ( nRtuhnrWeoardks<)n,c ctliFduInncB#l#ofcukn(ct,h rteyapdeI,d xF.uxn)c,# #gdreovurpe(dgorpo ,| ^~~~~~~~~~~~~~~~~N CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O60_:# #note: afield 'group' will be initialized after field 'stepSize'l go, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c k| ( ^t hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )note: ,field 'nthreads' will be initialized after field 'tidInBlock' grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~d ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562(:A15l:l Rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]d uce, COL L562N | E T _ D ItRiEdC(Tt,i dS)I,M PnLtEh,r eMaadxs,( nutihnrte6a4d_st)), t| i^d InBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k391(:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd Idx.x )391, | g rRouunpW(ogrrko.,c oNmCmC.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), L| ^~~~~~~~~~~~~~~~~ _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O60_:# #note: afield 'group' will be initialized after field 'stepSize'l go, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c k| ( ^t hreadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pnote: (field 'nthreads' will be initialized after field 'tidInBlock'g roup), 562 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562:15:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid( t562i | d ) , ntthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p), | 563 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) stepS i563z | e ( n c csltSehpmSeimz.ec(onmcmc.lbSuhfmfeSmi.zceosm[mN.CbCuLf_fPSRiOzTeOs_[SNICMCPLL_EP]R/ONTCOC_LS_ISMTPELPES]//sNiCzCeLo_fS(TTE)P)S /{s i z| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o f (| T group(group) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641| : group(group11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655 : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here prims (655t | i d - t i d S t a r tpRreidmusc(et,i dn-TthirdeSatdasrRteRdeudcuec,e ,d inrTehcrte-a>ddsoRwend,u c&ed,i rneucltl-p>toru,t ,& dairrgesc-t>-s>eonudtb,u fafr,g sa-r>gsse-n>drbeucfvfb,u fafr,g s -| > ^r ecvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202 :| 53 ^: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202: | 202 : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here Run W202o | r k E l e m e n tR,( )A.lrguon,( wPer)o;t o >| ( ^) .run(we);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp : 8| : ^1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp: 98: | 1I:M Pnote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested here_ COLL _9F | UINMCP(LA_lClORLeLd_uFcUeN,C (CAOlLlLRNeEdTu_cDeI,R ECCOTL,L NSEITM_PDLIER,E CMTa,x ,S IiMnPtL6E4,_ tM)a x ,| ^u int64/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:t391): 95 :| ^note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95 :391 | note: expanded from macro 'IMPL_COLL_FUNC' RunWo r391k | < n cRculnFWuonrck#<#nfcucnlcF,u ntcy#p#ef,u nFcu,n ct#y#pdee,v rFeudnocp#<#tdyepver>e,d oNpCG,O _N#C#CaLl_gAoL,G ON_C#C#La_lPgRoO,T ON_C#C#Lp_rPoRtOoT>O(_)#.#rpurno(t&on>c(c)l.Srhumne(m&.nwcocrlkS)h;m e\m . w| o ^r k); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 15: note: field 'nthreads' will be initialized after field 'tidInBlock'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: note: 562field 'nthreads' will be initialized after field 'tidInBlock' | t562i | d ( t i dt)i,d (nttihdr)e,a dnst(hnrtehardesa(dnst)h,r etaiddsI)n,B ltoicdkI(ntBhlroecakd(Itdhxr.exa)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ^~~~~~~~~~~~~~~~~p ), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~: 562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :field 'group' will be initialized after field 'stepSize'60 : note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h{: 562 :| 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: warning: | initializer order does not match the declaration order [-Wreorder-ctor] group(group 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:t687i:d11):, note: nin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreads (687n | t h r e a d s ) , tpirdiImnsB(ltoicdk-(ttihdrSetaadrItdBxc.axs)t,, gnrTohurpe(agdrsoBucpa)s,t , | & ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d i r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c t->ou t563, | n u l lsptterp,S iazreg(sn-c>csleSnhdmbeumf.fc,o mamr.gbsu-f>frSeiczvebsu[fNfC,C L _| P ^R OTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:L202E:]53/:N Cnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereL _STE P202S | / s i z e o f ( TR)u)n W{o r k| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l e m| e group(groupn t, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereA lgo, P r677o | t o > ( ) . /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr :u 562n :( 15w:pe r)warning: i;initializer order does not match the declaration order [-Wreorder-ctor]m s (| t ^i d-tid S562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppt | :a 7r :t 1 B:tc iadnote: s(in instantiation of member function 'RunWork, 2, 2>::run' requested herett i,d ) n,7T | hnIrtMhePraLeda_sdCBsOc(LanLst_thFr,eU aNd&Csd)(i,A rtleilcdRtIen-Bdl>ouocckue(tt,,h r CedOaidLrILdeNxcE.tTx-)_>,Dd IgoRrwEonCu,Tp ,(a grrSgoIsuM-pP>)L,sE e, n | dM ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ba ux f,| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) , u ianrtg3s 2563-_ | >t r) e c v| sbt^eup fSfi,z /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he (:n| c391c ^l:S h95m:e m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: .:expanded from macro 'IMPL_COLL_FUNC'c202o m:m53.:b u note: f391in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heref | S i z Re202us | n[ WN oC rCL_PROT kO _666o,:>9 (:N) C.note: Crin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL u_nA(LwGeO )666_; | # # a | l ^g o , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpppN:rC8iC:mL1s_(:Pt iRnote: dOin instantiation of member function 'RunWork, 2, 2>::run' requested here,T On_T #h8#r | peIraMdoPstGLoa_t>Ch(Oe)Lr.L,r_ uFdniU(rN&eCnc(ctAc-ll>luSRphe,md eNuUmLcL.e,w, oa rrCkgO)sL;-L> Ns\Ee nT d_| bD ^uIf RfE,C Ta/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,r: g562sS:-I15>Mr:ePc Lvnote: Ebfield 'nthreads' will be initialized after field 'tidInBlock',u fMfa, x 562, | | i ^ n t 6t4i_dt()t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :i 202d| :)^53, : n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: t:in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh391 r:e95a:d s note: (202expanded from macro 'IMPL_COLL_FUNC'n | t h r e a d 391s | ) R, u RntuWinodrWIkoEnrlBkelm#(u#)p.d)reu,nv (rw ee| d) ^~~~~~~~~~~~~~~~~o; p :, note: Nfield 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppC :C7L:_ 1A562:L | Gnote: in instantiation of member function 'RunWork, 2, 2>::run' requested hereO _ # #tai ld7g( | otI,iM dPNL)C_,CC LOnL_tLPh_RrFOeUTaNOdC_s(#A(#lnlptRrehodrtueoca>ed(,s) ).C,rO uLtLniN(Ed&TI_nnDcBIclRlEoSCcTh,km (SetImMh.PrwLeoEar,d kIM)ad;xx ,. \xu i)n ,t| 3 ^2g _rto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h)u: p562 (:| g15^r: o unote: pfield 'nthreads' will be initialized after field 'tidInBlock') /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 56295 ^~~~~~~~~~~: | note: expanded from macro 'IMPL_COLL_FUNC' tid(ti d391) | , nRtuhnrWeoardks<(nnctchlrFeuandcs#)#,f utnicd,I ntBylpoec,k (Ftuhnrce#a#ddIedvxr.exd)o,p (,g rNoCuCpL)_,A L G| O ^~~~~~~~~~~~~~~~~_ ##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:g562o:,60 :N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _PROT O562_ | # # p r ottiod>((t)i.dr)u,n (n&tnhcrcelaSdhsm(enmt.hwroerakd)s;) ,\ t i| d ^I nBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock'I dx.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~h reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : warning: pinitializer order does not match the declaration order [-Wreorder-ctor]r ims(tid- t562i | d S t a rttiRde(dtuicde),, nnTthhrreeaaddssR(endtuhcree,a ddsi)r,e ctti-d>IdnoBwlno,c k&(dtihrreecatd-I>doxu.tx,) ,a rggrso-u>ps(egnrdobuupf)f,, a| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g s -| > tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r ecvbuf f563, | | ^ stepSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:(202n:c53c:l Snote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem em.c o202m | m . b u f f S i zReusn[WNoCrCkLE_lPeRmOeTnOt_T())). r{u n (| w ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e ) ;| group(group | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp::68710::111:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | 687I | M P L _ C O L L _ F UpNrCi(mAsl(ltRiedd-utcied,S tCaOrLtLBNcEaTs_tD,I RnETChTr,e aSdIsMBPcLaEs,t ,M a&xd,i rheacltf-)> o u| t^, nul/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:p391t:r95,: anote: rexpanded from macro 'IMPL_COLL_FUNC'g s->se n391d | b u fRfu,n Waorrgks<-n>crcelcFvubnucf#f#,f u n| c ^, type, Fu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:c202#:#53d:e vnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree dop , N C C L _RAuLnGWOo_r#k#Eallegmoe,n tNo(,) .Prruont(o&>n(c)c.lrSuhnm(ewme.)w;o r k| ) ^; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::8562::115:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herefield 'nthreads' will be initialized after field 'tidInBlock' 8562 | | I M P L _tCiOdL(Lt_iFdU)N,C (nAtlhlrReeaddusc(en,t hCrOeLaLdNsE)T,_ DtIiRdEICnTB,l oScIkM(PtLhEr,e aMdaIxd,x .ixn)t,6 4g_rto)u p (| g^r oup)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 95 ^~~~~~~~~~~~~~~~~: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hexpanded from macro 'IMPL_COLL_FUNC': 562:60: note: field 'group' will be initialized after field 'stepSize' 391 | R562u | n W o r kt, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' _ 562 | # # ptriodt(ot>i(d)).,r unnt(h&rnecacdlsS(hnmtehmr.ewaodrsk)),; t\i d I| n ^B lock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p(group )562, | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ educe, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:_562D:I15R:E Cwarning: Tinitializer order does not match the declaration order [-Wreorder-ctor], SIMPLE, M562a | x , h atlifd)( t i| d^) , nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:s (note: nexpanded from macro 'IMPL_COLL_FUNC't hreads) ,391 | t i dRIunnBWloorckk<(ntchcrleFaudnIcd#x#.fxu)n,c ,g rtoyuppe(,g rFouunpc)#,# d e| v ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e d| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p 563, | N C C Ls_tAeLpGSOi_z#e#(anlcgcol,S hNmCeCmL._cPoRmOmT.Ob_u#f#fpSriozteos>[(N)C.CrLu_nP(R&OnTcOc_lSSIhMmPeLmE.]w/oNrCkC)L;_ S\T E P| S ^/ sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:T562):)15 :{ note: field 'nthreads' will be initialized after field 'tidInBlock'| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d626(:t9i:d )note: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here nthrea d626s | ( n t h r e a d sp)r,i mtsi(dtIindB-ltoicdkS(ttahrrteSacdaItdtxe.rx,) ,n Tghrroeuapd(sgSrcoautpt)e,r , | N ^~~~~~~~~~~~~~~~~U LL,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :d562i:r60e:c tnote: -field 'group' will be initialized after field 'stepSize'> up, a r562g | s - > s etniddb(utfifd,) ,a rngtsh-r>eraedcsv(bnutfhfr,e a d| s ^) , tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:n202B:l53o:c knote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hrea d202I | d x . x ) , g rRouunpW(ogrrkoEulpe)m,e n t| < ^~~~~~~~~~~F n, T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groulFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorki,d (NtCiCdL)_,A LnGtOh_r#e#aadlsg(on,t hNrCeCaLd_sP)R,O TtOi_d#I#npBrlootcok>((t)h.rreuand(I&dnxc.cxl)S,h mgermo.uwpo(rgkr)o;u p\) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :563 | note: field 'nthreads' will be initialized after field 'tidInBlock' ste p562S | i z e ( ntcicdl(Sthimde)m,. cnotmhmr.ebaudfsf(Snitzherse[aNdCsC)L,_ PtRiOdTIOn_BSlIoMcPkL(Et]h/rNeCaCdLI_dSxT.ExP)S,/ sgirzoeuopf((gTr)o)u p{) , | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 641562: | 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret id(ti d641) | , n t h r e a d s (pnrtihmrse(atdisd)-,t itdiSdtIanrBtlRoecdku(cteh,r enaTdhIrdexa.dxs)R,e dgurcoeu,p (dgirroeucpt)-,> d o| w ^~~~~~~~~~~n , &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork:,562 :N15C:C Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]A LGO_##algo, 562N | C C L _ PtRiOdT(Ot_i#d#)p,r onttoh>r(e)a.drsu(nn(t&hnrcecaldSsh)m,e mt.iwdoIrnkB)l;o c\k ( t| h ^r eadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(gro u562p | ) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d), n563t | h r e a dsst(enptShirzeea(dnsc)c,l SthimdeImn.Bcloomcmk.(btuhfrfeSaidzIedsx[.NxC)C,L _gPrRoOuTpO(_gSrIoMuPpL)E,] / N| C ^~~~~~~~~~~~~~~~~C L_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:P562S:/60s:i znote: efield 'group' will be initialized after field 'stepSize'o f(T)) 562{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(group( tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d626s:(9n:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads) ,626 | t i d I n B l o cpkr(itmhsr(etaiddI-dtxi.dxS)t,a rgtrSocuapt(tgerro,u pn)T,h r e| a ^~~~~~~~~~~d sScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhmem.w:o562r:k15):; warning: \ initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :563562 | : 60 : note: sfield 'group' will be initialized after field 'stepSize't epSi z562e | ( n c c ltSihdm(etmi.dc)o,m mn.tbhurfefaSdisz(enst[hNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp)()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O15_:# #warning: ainitializer order does not match the declaration order [-Wreorder-ctor]l go, NCCL _562P | R O T O _t#i#dp(roto>t(i)d.)r,u nn(t&hnrcecaldSsh(mnetmh.rweoardks));, \t i d| I ^n Block(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Inote: dfield 'nthreads' will be initialized after field 'tidInBlock'x .x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads( n563t | h r e a dsst)e,p StiizdeI(nnBclcolcSkh(mtehmr.ecaodmImd.xb.uxf)f,S igzreosu[pN(CgCrLo_uPpR)O,T O _| S ^~~~~~~~~~~~~~~~~I MPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h]:/562N:C60C:L _note: Sfield 'group' will be initialized after field 'stepSize'T EPS/s i562z | e o f ( Tt)i)d ({t i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t group(grouph reads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r641e:a11d:s )note: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tidIn B641l | o c k ( t h r e a d Ipdrxi.mxs)(,t igdr-otuipd(SgtraorutpR)e,d u c| e ^~~~~~~~~~~, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:c562v:b15u:f fwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :t53i:d (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d), n t202h | r e a d s ( n t hRruenaWdosr)k,E lteimdeInntBr(o)u.pr)u,n ( w| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) ; | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp : 10 : 1s:t enote: pin instantiation of member function 'RunWork, 2, 2>::run' requested hereS ize( n10c | cIlMSPhLm_eCmO.LcLo_mFmU.NbCu(fAflSliRzeedsu[cNeC,C LC_OPLRLONTEOT__SDIIMRPELCET],/ NSCICMLP_LSET,E PMSa/xs,i zheaolff()T ) )| ^{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 group(group: 95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677 :39111 | : note: Rin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu nWork< n677c | c l F u n c # # f u npcr,i mtsy(ptei,d -FtuindcS#t#adretvBrceadsotp,< tnyTpher>e,a dNsCBCcLa_sAtL,G O&_d#i#raelcgto-,> oNuCtC,L _dPiRrOeTcOt_-#>#dporwont,o >a(r)g.sr-u>ns(e&nndcbculfSfh,m eamr.gwso-r>kr)e;c v\b u f| f ^, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 202note: :field 'nthreads' will be initialized after field 'tidInBlock' 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i d ) ,R unntWhorrekaEdlse(mnetnhtrx(.)x.)r,u ng(rwoeu)p;( g r| o ^u p), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp| : ^~~~~~~~~~~~~~~~~11 :1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWork, 2, 2>::run' requested here: 60: note: field 'group' will be initialized after field 'stepSize'11 | IMPL _562C | O L L _ FtUiNdC((tAildl)R,e dnutcher,e aCdOsL(LnNtEhTr_eDaIdRsE)C,T ,t iSdIIMnPBLlEo,c kM(atxh,r efaldoIadtx). x )| ,^ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(391g:r95o:u pnote: )expanded from macro 'IMPL_COLL_FUNC', | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:B15c:a swarning: tinitializer order does not match the declaration order [-Wreorder-ctor], &direct- >562o | u t , dtiirde(ctti-d>)d,o wnnt,h raeragdss-(>nstehnrdebaudfsf),, atrigdsI-n>Brleoccvkb(utfhfr,e a d| I ^d x.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r202o:u53p:( gnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo up), 202 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) Run W563o | r k E l esmteenptSe(s)[.NrCuCnL(_wPeR)O;T O _| S ^I MPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp/:N11C:C1L:_ Snote: Tin instantiation of member function 'RunWork, 2, 2>::run' requested hereE PS/s i11z | eIoMfP(LT_)C)O L{L _ F| U ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N C (| A group(groupl lReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :C641O:L11L:N Enote: Tin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ DIRECT ,641 | S I M P L E , M a xp,r ifmlso(atti)d - t| i^d Star/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:R391e:d95u:c enote: ,expanded from macro 'IMPL_COLL_FUNC' nThrea d391s | R e dRuucneW,o rdkiudnocw#n#,f u&ndci,r etcytp-e>,o uFtu,n ca#r#gdse-v>rseednodpba,r gNsC-C>Lr_eAcLvGbOu_f#f#,a l g| o ^, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:P202R:O53T:O _note: #in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here# prot o202> | ( ) . r u n ( & nRcucnlWSohrmkeEml.ewmoernkt)<;F n\, T| , ^ RedO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:,562 :A15l:g onote: ,field 'nthreads' will be initialized after field 'tidInBlock' Proto >562( | ) . r u nt(iwde()t;i d )| , ^ nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppd:s9(:n1t:h rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herea ds), 9t | iIdMIPnLB_lCoOcLkL(_tFhUrNeCa(dAIldlxR.exd)u,c eg,r oCuOpL(LgNrEoTu_pD)I,R E C| T ^~~~~~~~~~~~~~~~~, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L60E:, note: Mfield 'group' will be initialized after field 'stepSize'a x, ui n562t | 6 4 _ t )t i d| (^t id)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391n:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd s(nthr e391a | d s )R,u ntWiodrIkn, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL _562A | L G O _ #t#iadl(gtoi,d )N,C CnLt_hPrReOaTdOs_(#n#tphrroetaod>s()),. rtuind(I&nnBclcolcSkh(mtehmr.ewaodrIkd)x;. x\) , | g ^r oup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)15,: note: | field 'nthreads' will be initialized after field 'tidInBlock' ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d ( tsitde)p,S inzteh(rnecacdlsS(hnmtehmr.ecaodmsm).,b utfifdSIinzBelso[cNkC(CtLh_rPeRaOdTIOd_xS.IxM)P,L Eg]r/oNuCpC(Lg_rSoTuEpP)S,/ s i| z ^~~~~~~~~~~~~~~~~e of(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:)562): 60{: note: | field 'group' will be initialized after field 'stepSize' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i626d:)9,: nnote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh reads (626n | t h r e a d s ) ,p rtiimdsI(ntBildo-ctki(dtShtraeratdSIcdaxt.txe)r,, gnrTohurpe(agdrsoSucpa)t,t e r| , ^~~~~~~~~~~ NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hLNET:_562D:I15R:E Cwarning: Tinitializer order does not match the declaration order [-Wreorder-ctor], SIMPLE, Max ,562 | d o u b ltei)d ( t| id),^ nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95(:n tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eads), 391t | i d IRnuBnlWoocrkk( | , N C CsLt_eApLSGiOz_e#(#naclcgloS,h mNeCmC.Lc_oPmRmO.TbOu_f#f#Spirzoetso[>N(C)C.Lr_uPnR(O&TnOc_cSlISMhPmLeEm]./wNoCrCkL)_;S T\E P S| / ^s izeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:)562): 15{: note: | field 'nthreads' will be initialized after field 'tidInBlock' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i626d:)9,: nnote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh reads( n626t | h r e a d s ) , ptriidmIsn(Btliodc-kt(itdhSrteaardtISdcxa.txt)e,r ,g rnoTuhpr(egardosuSpc)a,t t e| r ^~~~~~~~~~~~~~~~~, NUL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:,562 :d60i:r enote: cfield 'group' will be initialized after field 'stepSize't ->up, 562a | r g s - >tsiedn(dtbiudf)f,, natrhgrse-a>drse(cnvtbhurfefa,d s )| , ^ tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k202(:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered Idx. x202) | , g r o u p ( gRruonuWpo)r,k E l| e ^~~~~~~~~~~m ent().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-15>:r ewarning: cinitializer order does not match the declaration order [-Wreorder-ctor]v buff, | ^562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t202i:d53):, note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hrea d202s | ( n t h r e a d sR)u,n WtoirdkIEnlBelmoecnkt(,( ) .| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n (| w tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ); | 563 ^ | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppt:e9p:S1i:z enote: (in instantiation of member function 'RunWork, 2, 2>::run' requested heren cclS h9m | eImM.PcLo_mCmO.LbLu_fFfUSNiCz(eAsl[lNRCeCdLu_cPeR,O TCOO_LSLINMEPTL_ED]I/RNECCCTL,_ SSTIEMPPSL/Es,i zMeaoxf,( Tu)i)n t{6 4 _| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) | | group(group^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::95666:: 9note: :expanded from macro 'IMPL_COLL_FUNC' note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | 666 | R u n W o r k ,ou ppn<,tt hyNrpUeeLa>Ld,,s (NanCrtCg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:I dwarning: xinitializer order does not match the declaration order [-Wreorder-ctor]. x), grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n thre a563d | s ( n t hsrteeapdSsi)z,e (tnicdcIlnSBhlmoecmk.(ctohmrme.abduIfdfxS.ixz)e,s [gNrCoCuLp_(PgRrOoTuOp_)S,I M P| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~E ] /| N tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C CL_ST E563P | S / s i zsetoefp(STi)z)e ({n c c| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S h m| e group(groupm .comm.buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h[:N677C:C11L:_ Pnote: Rin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO TO_SIM P677L | E ] / N C C L _ S T EpPrSi/mssi(zteiodf-(tTi)d)S t{a r t| B ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c a s| t group(group, nThreadsB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:a641s:t11,: ¬e: din instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei rect- >641o | u t , d i r e c t -p>rdiomwsn(,t iadr-gtsi-d>SsteanrdtbRuefdfu,c ea,r gnsT-h>rreeacdvsbRuefdfu,c e ,| ^d irect->dow/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:,202 :&53d:i rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec t->o u202t | , a r g s - > sReunndWbourfkfE,l eamregnst-<>Frne,c vTb,u fRfe,d O p| , ^ Algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:r202o:t53o:> (note: ).runin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( we); 202 | | ^ Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppn:W9o:r1k:E lnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herem ent< F9n | I,M PTL,_ CROeLdLO_pF,U NACl(gAol,l RPerdoutcoe>,( )C.OrLuLnN(EwTe_)D;I R E| C ^T , SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppL:E9,: 1M:a xnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here uin t96 | 4I_MtP)L _ C| O^L L_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:A391l:l95R:e dnote: uexpanded from macro 'IMPL_COLL_FUNC'c e, COLL N391E | T _ DRIuRnEWCoTr,k , N C391C | L _ ARLuGnOW_o#r#kau(n)c.#r#udne(v&rnecdcolpSw,o rNkC)C;L _\A L G| O ^_ ##algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:C15C:L _note: Pfield 'nthreads' will be initialized after field 'tidInBlock'R OTO_ #562# | p r o t ot>i(d)(.triudn)(,& nnctchlrSehamdesm(.nwtohrrke)a;d s\) , | t ^i dInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a dIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~e ads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's ), t i562d | I n B l otcikd((tthirde)a,d Indtxh.rxe)a,d sg(rnotuhpr(egardosu)p,) ,t i d| I ^~~~~~~~~~~~~~~~~n Blo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd Idx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~r eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hWork:<562n:c15c:l Fwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]n c##func, typ e562, | F u n ct#i#dd(etvirde)d,o pnd,s (NnCtChLr_eAaLdGsO)_,# #taildgIon,B lNoCcCkL(_tPhRrOeTaOd_I#d#xp.rxo)t,o >g(r)o.urpu(ng(r&onucpc)l,S h m| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m . w| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r k); \563 | | ^ stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562(:n15c:c lnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'h mem.c o562m | m . b u ftfiSdi(zteisd[)N,C CnLt_hPrReOaTdOs_(SnItMhPrLeEa]d/sN)C,C Lt_iSdTIEnPBSl/oscikz(etohfr(eTa)d)I d{x . x| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, g| r group(groupo up(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h11::562 :note: 60in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: field 'group' will be initialized after field 'stepSize' 641 | 562 | t i dp(rtiimds)(,t indt-htriedaSdtsa(rnttRherdeuacdes,) ,n TthirdeIandBslRoecdku(cteh,r edaidrIedcxt.-x>)d,o wgnr,o u&pd(igrreocutp-)>,o u t| , ^~~~~~~~~~~ args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:-562t:i15d:S twarning: ainitializer order does not match the declaration order [-Wreorder-ctor]r tReduce, 562n | T h r e atdisdR(etdiudc)e,, nntuhlrlepatdrs(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIREC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT, SI:M562P:L15E:, warning: Minitializer order does not match the declaration order [-Wreorder-ctor]a x, half) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nt h391r | e a dRsu(nnWtohrrke),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ A| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)G O_## a563l | g o , NsCtCeLp_SPiRzOeT(On_c#c#lpSrhomteom>.(c)o.mrmu.nb(u&fnfcScilzSehsm[eNmC.CwLo_rPkR)O;T O\_ S I| M ^P LE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:S Tnote: Efield 'nthreads' will be initialized after field 'tidInBlock'P S/siz e562o | f ( T ) )t i{d ( t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d ) ,| group(groupn threads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s687):,11 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI nBlock (687t | h r e a d I d x . x )p,r igmrso(utpi(dg-rtoiudpS)t,a r t| B ^~~~~~~~~~~~~~~~~c as/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:,562 :n60T:h rnote: efield 'group' will be initialized after field 'stepSize'a dsBc a562s | t , & dtiirde(ctti-d>)o,u tn,t hnruelaldpst(rn,t harregasd-s>)s,e ntdibduIfnfB,l oacrkg(st-h>rreeacdvIbduxf.fx,) , | g ^r oup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u202p:)53,: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:-562>:d15o:w nwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] args->se n562d | b u f f ,t iadr(gtsi-d>)r,e cnvtbhurfefa,d s (| n ^t hreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t202i:d53I:n Bnote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo ck(t h202r | e a d I d x . x )R,u ngWroorukpE(lgermoeunpt)<,F n ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T , | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e dOp, 563A | l g o , sPtreoptSoi>z(e)(.nrcucnl(Swhem)e;m . c| o ^m m.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppi:z10e:s1[:N Cnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _PRO T10O | _ISMIPMLP_LCEO]L/LN_CFCULN_CS(TAElPlSR/esdiuzceeo,f (CTO)L)L N{E T _| D ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I R E| C group(groupT , SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 641M:a11x:, note: hin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea lf) | ^ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : note: pexpanded from macro 'IMPL_COLL_FUNC'r ims(ti d391- | t i dRSutnaWrotrRkecd#o#wdne,v r&eddiorpeeo>u,t ,N CaCrLg_sA-L>GsOe_n#d#baulfgfo,, aNrCgCsL-_>PrReOcTvOb_u#f#fp,r o t| o ^> ().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h&:n202c:c53l:S hnote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree m.wo r202k | ) ; \ | ^ RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:l562e:m15e:n tnote: r(e)a.drsu(nn(twher)e;a d s| ) ^, tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppB:l12o:c1k:( tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer ead I12d | xI.MxP)L,_ CgOrLoLu_pF(UgNrCo(uApl)l,R e d| u ^~~~~~~~~~~~~~~~~c e, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:O562L:L60N:E Tnote: _field 'group' will be initialized after field 'stepSize'D IRECT ,562 | S I M P LtEi,d (Mtaixd,) ,d onutbhlree)a d s| (^n thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391s:)95,: tnote: iexpanded from macro 'IMPL_COLL_FUNC'd InBloc k391( | t h rReuandWIodrxk., NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]g roup(gro u562 | p ) , t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ( t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d ), nthrea d563s | ( n t h rsetaedpsS)i,z et(indcIcnlBSlhomcekm(.tchormema.dbIudfxf.Sxi)z,e sg[rNoCuCpL(_gPrRoOuTpO)_,S I M| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L E ]| / tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N CCL_ S563T | E P S / ssitzeepoSfi(zTe)()n c{c l S| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m e m| . group(groupc omm.buffSizes[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:L677_:P11R:O Tnote: Oin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ SIMPL E677] | / N C C L _ S T E P Sp/rsiimzse(otfi(dT-)t)i d{S t a| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t B c| a group(groups t, nThre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d677s:B11c:a snote: tin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, &dire c677t | - > o u t , d i r epcrti-m>sd(otwind,- tairdgSst-a>rsteBncdabsutf,f ,n Tahrrgesa-d>srBeccavsbtu,f f&,d i r| e ^c t->out, d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:r202e:c53t:- >note: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo wn, a202r | g s - > s e n d bRuufnfW,o rakrEglse-m>ernetc, 2, 2>::run' requested here> ().r u202n | ( w e ) ; | ^R unWorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppl:e11m:e1n:t , 2, 2>::run' requested heren , T, 11R | eIdMOPpL,_ CAOlLgLo_,F UPNrCo(tAol>l(R)e.druucne(,w eC)O;L L N| E ^T _DIRECT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp,: 12S:I1M:P Lnote: Ein instantiation of member function 'RunWork, 2, 2>::run' requested here, Max ,12 | fIlMoPaLt_)C O L| L^_ FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:A391l:l95R:e dnote: uexpanded from macro 'IMPL_COLL_FUNC'c e, COL LN391E | T _ DRIuRnEWCoTr,k ,391 | N C CRLu_nAWLoGrOk_<#n#cacllgFou,n cN#C#CfLu_nPcR,O TtOy_p#e#,p rFoutnoc>#(#)d.ervurne(d&onpce,m .NwCoCrLk_)A;L G\O _ #| # ^a lgo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:P Rnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'T O_##p r562o | t o > ( )t.irdu(nt(i&dn)c,c lnSthhmreema.dwso(rnkt)h;r e\a d s| ) ^, tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o15c:k (note: tfield 'nthreads' will be initialized after field 'tidInBlock'h readI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s (nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60s:) ,note: field 'group' will be initialized after field 'stepSize't idInBl o562c | k ( t h rteiadd(Itdixd.)x,) ,n tghrroeuapd(sg(rnotuhpr)e,a d s| ) ^~~~~~~~~~~~~~~~~, ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B60l:o cnote: kfield 'group' will be initialized after field 'stepSize'( threa d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~n threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkt,i dN(CtCiLd_)A,L GnOt_h#r#eaaldgso(,n tNhCrCeLa_dPsR)O,T Ot_i#d#IpnrBoltooc>k(()t.hrruena(d&Indcxc.lxS)h,m egmr.owuopr(kg)r;o u\p ) ,| ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 563note: | field 'nthreads' will be initialized after field 'tidInBlock' st e562p | S i z e (tnicdc(ltSihdm)e,m .nctohmrme.abdusf(fnStihzreesa[dNsC)C,L _tPiRdOITnOB_lSoIcMkP(LtEh]r/eNaCdCILd_xS.TxE)P,S /gsriozuepo(fg(rTo)u)p ){, | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ | group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655 :t11i:d (note: tin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei d), nth r655e | a d s ( n t h r e a dpsr)i,m st(itdiIdn-BtliodcSkt(atrhtrReeadduIcdex,. xn)T,h rgeraoduspR(egdruocuep,) ,n u l| l ^~~~~~~~~~~p tr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_15S:I Mwarning: Pinitializer order does not match the declaration order [-Wreorder-ctor]L E]/NCCL_ S562T | E P S / stiizde(otfi(dT)),) n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ( group(groupn threads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I677n:B11l:o cnote: kin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( threa d677I | d x . x ) , g r o uppr(igmrso(utpi)d,- t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S t a| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t Bcast ,563 | n T h r esatdespBSciazset(,n c&cdliSrhemcetm-.>cooumtm,. bduifrfeScitz-e>sd[oNwCnC,L _aPrRgOsT-O>_sSeInMdPbLuEf]f/,N CaCrLg_sS-T>ErPeSc/vsbiuzfefo,f ( T| ) ^) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 group(group: 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 677202: | 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Ru n677W | o r k E l e m e n t T(h)r.eraudns(Bwcea)s;t , | & ^d irect-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp>:o12u:t1,: dnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested herer ect- >12d | oIwMnP,L _aCrOgLsL-_>FsUeNnCd(bAulflfR,e daurcges,- >CrOeLcLvNbEuTf_fD,I R E| C ^T , SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:,202 :M53a:x ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered ouble) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 12warning: | initializer order does not match the declaration order [-Wreorder-ctor]I MPL_COLL _562F | U N C ( AtlildR(etdiudc)e,, nCtOhLrLeNaEdTs_(DnItRhErCeTa,d sS)I,M PtLiEd,I nMBalxo,c kd(otuhbrleea)d I d| x^. x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:(95g:r onote: uexpanded from macro 'IMPL_COLL_FUNC'p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 391 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) RunW o563r | k < n c csltFeupnSci#z#ef(unnccc,l Sthympeem,. cFoumnmc.#b#udfefvSriezdeosp[, NCCL_ALGO_##algo,C LN_CPCRLO_TPOR_OSTIOMPL_##Ep]r/oNtCoC>L(_)S.TrEuPnS(/&sniczceloSfh(mTe)m). w{o r k| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~; \| group(group | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::562626::159:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562626 | | t i d ( tpirdi)m,s (nttihdr-etaiddsS(tnatrhtrSecaadtst)e,r ,t indTIhnrBelaodcskS(ctahtrteeard,I dNxU.LxL),, dgirroeucpt(-g>ruopu,p )a,r g s| - ^~~~~~~~~~~~~~~~~> send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f60,: anote: rfield 'group' will be initialized after field 'stepSize'g s->r e562c | v b u f ft,i d (| t ^i d), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s (note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hrea d202s | ) , t i d I n BRluoncWko(rtkhErleeamdeIndtx<.Fxn),, Tg,r oRuepd(Ogpr,o uApl)g,o , | P ^~~~~~~~~~~r oto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e15m:. wwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]r k); \ | ^562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )note: ,field 'nthreads' will be initialized after field 'tidInBlock' nthrea d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~x . x| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), gro u563p | ( g r o uspt)e,p S i| z ^~~~~~~~~~~~~~~~~e (nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:l562S:h60m:e mnote: .field 'group' will be initialized after field 'stepSize'c omm.b u562f | f S i z etsi[dN(CtCiLd_)P,R OnTtOh_rSeIaMdPsL(En]t/hNrCeCaLd_sS)T,E PtSi/dsIinzBeloofc(kT()t)h r{e a d| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d x .| x group(group) , group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o666u:p9):, note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^~~~~~~~~~~ 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:S15t:a rwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]R educe, n T562h | r e a d stRiedd(utcied,) ,d inrtehcrte-a>ddso(wnnt,h r&edaidrse)c,t -t>ioduItn,B laorcgks(-t>hsreenaddbIudfxf.,x )a,r ggsr-o>urpe(cgvrbouufpf),, | | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202563: | 53 : note: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret epSi z202e | ( n c c l S h m eRmu.ncWoomrmk.EbluefmfeSnitz_(S)T.ErPuSn/(swiez)e;o f (| T ^) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :| 13 group(group: 1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 66613: | 9I:M Pnote: Lin instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ COLL_ F666U | N C ( A l l R e dpurciem,s (CtOiLdL,N EnTT_hDrIeRaEdCsTG,a tShIeMrP,L Ed,i rMeacxt,- >rucpc,l _NbUfLlLo,a ta1r6g)s - >| s^e ndbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:,391 :a95r:g snote: -expanded from macro 'IMPL_COLL_FUNC'> recvbuf f391, | | R ^u nWork, 2, 2>::run' requested here# func ,202 | t y p e , F u nRcu#n#WdoervkrEeldeompe,, TN,C CRLe_dAOLpG,O _A#l#gaol,g oP,r oNtCoC>L(_)P.RrOuTnO(_w#e#)p;r o t| o ^> ().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppn:(10&:n1c:c lnote: Sin instantiation of member function 'RunWork, 2, 2>::run' requested hereh mem .10w | oIrMkP)L;_ C\O L L| _ ^F UNC(AllR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562u:c15e:, note: COLLNET_DIRECT, SIMPLE, Max, half) | ^ field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391562: | 95 : note: expanded from macro 'IMPL_COLL_FUNC't id(ti d391) | , nRtuhnrWeoardks<(nnctchlrFeuandcs#)#,f utnicd,I ntBylpoec,k (Ftuhnrce#a#ddIedvxr.exd)o,p (,g rNoCuCpL)_,A L G| O ^~~~~~~~~~~~~~~~~_ ##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:,562 :N60C:C Lnote: _field 'group' will be initialized after field 'stepSize'P ROTO_## p562r | o t o > (t)i.dr(utni(d&)n,c cnltShhrmeeamd.sw(onrtkh)r;e a\d s )| , ^ tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o15c:k (note: tfield 'nthreads' will be initialized after field 'tidInBlock'h readI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~s (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:: 562note: :field 'group' will be initialized after field 'stepSize'15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]562 | tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~g roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562I:d15x:. xwarning: )initializer order does not match the declaration order [-Wreorder-ctor], group( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~t id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork | , N C CtLi_dA(LtGiOd_)#,# anltghor,e aNdCsC(Ln_tPhRrOeTaOd_s#)#,p rtoitdoI>n(B)l.orcukn((t&hnrcecaldSIhdmxe.mx.)w,o rgkr)o;u p\( g r| o ^u p), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 15 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: field 'nthreads' will be initialized after field 'tidInBlock' 563562 | | sttiedp(Stiizde)(,n cnctlhSrhemaedms.(cnotmhmr.ebaudfsf)S,i zteisd[INnCBClLo_cPkR(OtThOr_eSaIdMIPdLxE.]x/)N,C CgLr_oSuTpE(PgSr/osuipz)e,o f (| T ^~~~~~~~~~~~~~~~~) ) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h{: 562 :| 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | field 'group' will be initialized after field 'stepSize' group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i641d:(11t:i dnote: )in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, nthre a641d | s ( n t h r e a d s )p,r itmisd(ItniBdl-otcikd(SttharretaRdeIdduxc.ex,) ,n Tghrroeuapd(sgRreoduupc)e,, d| i ^~~~~~~~~~~r ect->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ float16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShme group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(mAllRedu.cceo,m mC.ObLuLfNfESTi_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hzD:eI562sR:[E15NC:CT C,warning: L initializer order does not match the declaration order [-Wreorder-ctor]_S PIRMOPTLOE _,562S | IM Ma Px L, E t]fi/ldNo(CatCtiL)d_ )S ,T| E^nP tSh/r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hse:ia391zd:es95o(:fn (tnote: Thexpanded from macro 'IMPL_COLL_FUNC')r )e a{d s 391)| | , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t Ri| ud group(groupnI WnoBrlko, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereFg urnocu#p#)d,e v r| 677e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | d o p| < tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) t y p e563 > | , pN rC iCsmLts_e(AptLSiGidOz-_et#(i#ndacSlctglaoSr,ht mBNecCmaC.sLct_o,Pm RmnO.TTbhOur_fe#fa#SdpisrzBoectsao[s>Nt(C,)C .L&r_duPinRr(Oe&TcnOtc_-cS>lIoSMuhPtmL,eE m]d./iwNroCerCckLt)_-;S> Td\Eo Pw Sn| /, ^s iazregosf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-(:>T562s):e)15n :d{ b note: u field 'nthreads' will be initialized after field 'tidInBlock'f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, a562| r | group(groupg s - > rteicdv(btui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hfd:f)655,,: 11 n:| t ^hnote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads(nt h655r | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he :a 202d :s 53) :, note: t in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d IpnrBilm os202c( | kt (i td h- rt ei ad dS ItRdauxrn.tWxRo)er,dk uEgclreeo,mu epnn(Ttgh c( t) -.t>rioudun(t(t,wi eda))r;,g s n-| t> ^hs reenaddbsu(fnf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppt,:h 11ra:er1ag:ds s-note: )>in instantiation of member function 'RunWork, 2, 2>::run' requested here,r etcivdb Iu11nf | BfIl,Mo Pc Lk| _( ^Ct OhLrLe_a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hFd:UI202Nd:Cx53(.:Ax l)note: l,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR egdr uo202cu | ep ,( g Cr Oo Lu Lp N) E,RT u_ nD| WI ^~~~~~~~~~~oR rEkCETl,e mSeInMtP:(391):.95r:u nnote: (expanded from macro 'IMPL_COLL_FUNC'w e); | ^391 | RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppk:<12n:c1c:l Fnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heren c##f u12n | cI,M PtLy_pCeO,L LF_uFnUcN#C#(dAelvlrReeddoupcL,L NNECTC_LD_IARLEGCOT_,# #SaIlMgPoL,E ,N CMCaLx_,P RdOoTuOb_l#e#)p r o| t^o >().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n391(:&95n:c cnote: lexpanded from macro 'IMPL_COLL_FUNC'S hmem.w o391r | k ) ;R u\n W o| r ^k d,s (NnCtChLr_eAaLdGsO)_,# #taildgIon,B lNoCcCkL(_tPhRrOeTaOd_I#d#xp.rxo)t,o >g(r)o.urpu(ng(r&onucpc)l,S h m| e ^~~~~~~~~~~~~~~~~m .wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;60 :\ note: field 'group' will be initialized after field 'stepSize'| ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i15d:( tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd ), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~) , group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:ec562v:b15u:f fwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 202 : 53 : tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered (tid )202, | n t h r e a d sR(unntWhorrekaEdlse)m,e nttiu(p)(.grruonu(pw)e,) ; | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp: 13563: | 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested heres tep S13i | zIeM(PnLc_cClOSLhLm_eFmU.NcCo(mAml.lbRuefdfuSciez,e sC[ONLCLCNLE_TP_RDOITROE_CSTI,M PSLIEM]P/LNEC,C LM_aSxT,E PrSc/csli_zbefolfo(aTt)1)6 ){ | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :641:11: 391note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunWo r641k | < n c c l F u n c # #pfruinmcs,( ttiydp-et,i dFSutnacr#t#Rdeedvurceed,o pnd,s RNeCdCuLc_eA,L GdOi_r#e#catl-g>od,o wNnC,C L&_dPiRrOeTcOt_-#>#opurto,t oa>r(g)s.-r>usne(n&dnbcucflfS,h maermg.sw-o>rrke)c;v b\u f f| , ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 202note: :field 'nthreads' will be initialized after field 'tidInBlock'53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i dR)u,n WnotrhkrEelaedmse(nnttI(d)x..rxu)n,( wger)o;u p (| g ^r oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp| : ^~~~~~~~~~~~~~~~~13 :1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'RunWork, 2, 2>::run' requested here562 :60: note: 13field 'group' will be initialized after field 'stepSize' | IMPL_ C562O | L L _ F UtNiCd((AtlildR)e,d uncteh,r eCaOdLsL(NnEtTh_rDeIaRdEsC)T,, tSiIdMIPnLBEl,o cMka(xt,h rrecacdlI_dbxf.lxo)a,t 1g6r)o u p| (^g roup)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 95 ^~~~~~~~~~~: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o15u:p (warning: ginitializer order does not match the declaration order [-Wreorder-ctor]r oup), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:C15C:L _warning: Ainitializer order does not match the declaration order [-Wreorder-ctor]L GO_##algo, NC C562L | _ P R O TtOi_d#(#tpirdo)t,o >n(t)h.rreuand(s&(nnctchlrSehmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: warning: field 'nthreads' will be initialized after field 'tidInBlock'initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 562 :| 60 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: field 'group' will be initialized after field 'stepSize' 563 | 562 | s t e ptSiidz(et(indc)c,l Snhtmherme.acdosm(mn.tbhurfefaSdisz)e,s [tNiCdCILn_BPlRoOcTkO(_tShIrMePaLdEI]d/xN.CxC)L,_ SgTrEoPuSp/(sgirzoeuopf)(,T ) )| ^~~~~~~~~~~{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp(:)12.:r1u:n (note: win instantiation of member function 'RunWork, 2, 2>::run' requested heree ); | ^12 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppL:_13C:O1L:L _note: Fin instantiation of member function 'RunWork, 2, 2>::run' requested hereU NC(A l13l | RIeMdPuLc_eC,O LCLO_LFLUNNECT(_ADlIlRREeCdTu,c eS,I MCPOLLEL,N EMTa_xD,I RdEoCuTb,l eS)I M P| L^E , Max/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391r:c95c:l _note: bexpanded from macro 'IMPL_COLL_FUNC'f loat16 )391 | | ^R unWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k391<:n95c:c lnote: Fexpanded from macro 'IMPL_COLL_FUNC'u nc##fu n391c | , tRyupneW,o rFkuy,p eN,C CFLu_nAcL#G#Od_e#v#raeldgoop,< tNyCpCeL>_,P RNOCTCOL__#A#LpGrOo_t#o#>a(l)g.or,u nN(C&CnLc_cPlRSOhTmOe_m#.#wporrokt)o;> (\) . r| u ^n (&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:m562e:m15.:w onote: rfield 'nthreads' will be initialized after field 'tidInBlock'k ); \ | 562 ^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )note: ,field 'nthreads' will be initialized after field 'tidInBlock' nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~. x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:( gnote: rfield 'group' will be initialized after field 'stepSize'o up), 562| | ^~~~~~~~~~~~~~~~~ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t60i:d )note: ,field 'group' will be initialized after field 'stepSize' nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~. x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hadIdx:.562x:)15,: gwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o up(group), | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 tid(:t60i:d )note: ,field 'group' will be initialized after field 'stepSize' nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. x )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) group (563g | r o u p )s,t e p| S ^~~~~~~~~~~i ze(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o up(group), 562| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id(ti d563) | , n t hsrteeapdSsi(znet(hnrcecaldSsh)m,e mt.icdoImnmB.lboucfkf(Stihzreesa[dNICdCxL._xP)R,O TgOr_oSuIpM(PgLrEo]u/pN)C,C L _| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T E P| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/ sizeo f563( | T ) ) {s t e| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S i z| e group(group( ncclShmem.comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:f916f:S7i:z enote: sin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here[ NCCL _916P | R O T O _ S IpMrPiLmEs](/gNrCoCuLp_TSiTdE,P Sg/rsoiuzpeNotfh(rTe)a)d s{, &| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e c v| , group(group &send, args->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:n916d:b7u:f fnote: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here args-> r916e | c v b u f f ,p r i| m ^s (groupT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202,: 53g:r onote: uin instantiation of member function 'RunWorkElement, 3, 2>::run' requested herep Nthr e202a | d s , & r e c vR,u n&WsoernkdE,l eamregnst-<>Fsne,n dTb,u fRfe,d Oapr,g sA-l>groe,c vPbruoftfo,> ( )| . ^r un(we); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ^: 53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7: 1202: | note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7R | uInMWPoLr_kCEOlLeLm_eFnUtN,( )S.IrMuPnL(Ew,e )P;r e M| u ^l Sum, ui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cppn:t53:21_:t )note: in instantiation of member function 'RunWork, 3, 2>::run' requested here | ^ 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:M391P:L95_:C Onote: Lexpanded from macro 'IMPL_COLL_FUNC'L _FUNC( A391l | l R eRduuncWeo,r kC ,| ^N CCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hG:O391_:#95#:a lnote: gexpanded from macro 'IMPL_COLL_FUNC'o , NCCL_ P391R | O T OR_u#n#Wporroktc(c)l.Fruunnc(#&#nfcucnlcS,h mteymp.ew,o rFku)n;c #\# d e| v ^r edop:,15 :N Cnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'L _ALGO _562# | # a l g ot,i dN(CtCiLd_)P,R OnTtOh_r#e#apdrso(tnot>h(r)e.ardusn)(,& ntcicdlISnhBmleomc.kw(otrhkr)e;a d\I d x| . ^x ), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock') , | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60t:i dnote: (field 'group' will be initialized after field 'stepSize't id), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~g rou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hId:x562.:x15):, warning: ginitializer order does not match the declaration order [-Wreorder-ctor]r oup(group) ,562 | | ^~~~~~~~~~~~~~~~~ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i60d:) ,note: field 'group' will be initialized after field 'stepSize'n thread s562( | n t h r etaidds()t,i dt)i,dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ ##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| :^562 :15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ P| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O TO_##p r563o | t o > ( )s.treupnS(i&znec(cnlcSchlmSehmm.ewmo.rcko)m;m .\b | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreadesd(OnptAhrrge,a d0s,) ,a rtgisd-I>ncBolnoncIkn(dtehxr,e aadrIgdsx-.>xc)o,n ngIrnoduepx()g;r o u| p ^) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :s80t:e5p:S inote: zin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heree (ncclS h80m | e m . c ormumn.Rbiunfgf_(SaIrMgPsL)E;] / N| C ^C L_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:z202e:o53f:( Tnote: )in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~202 | | group(group RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hE:l34e:m7e:n tnote: , FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereF n, T, Re d34O | p , A l g op,r iPmrso(ttoi>d(,) .nrtuhnr(ewaed)s;, &| r ^i ng->pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cppe:v11,: 1&:r inote: nin instantiation of member function 'RunWork, 1, 2>::run' requested hereg ->ne x11t | ,I MaPrLg_sC-O>LsLe_nFdUbNuCf(fR,e daurcges,- >RrIeNcGv,b uSfIfM,P LaEr,g sP-r>erMeudlOSpuAmr,g ,f l0o,a ta)r g s| -^> connIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:e391x:,95 :a rnote: gexpanded from macro 'IMPL_COLL_FUNC's ->conn I391n | d e xR)u;n W o| r ^k , ProtoSimple<1, 1>>' requested herec , ty p80e | , F u nrcu#n#Rdienvgr ,P rNoCtCoL>_(AaLrGgOs_)#;# a l| g ^o , NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:P202R:O53T:O _note: #in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here# proto >202( | ) . r u n ( & n cRculnSWhomrekmE.lweomrekn)t;< F\n , | T ^, RedOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :A562l:g15o:, note: Pfield 'nthreads' will be initialized after field 'tidInBlock'r oto>() .562r | u n ( w et)i;d ( t| i ^d ), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cppe:a9d:s1(:n tnote: hin instantiation of member function 'RunWork, 1, 2>::run' requested herer eads )9, | ItMiPdLI_nCBOlLoLc_kF(UtNhCr(eRaeddIudcxe.,x )R,I NgGr,o uSpI(MgPrLoEu,p )P,r e M| u ^~~~~~~~~~~~~~~~~l Sum,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :u562i:n60t:6 4note: _field 'group' will be initialized after field 'stepSize't ) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nth r391e | a d sR(unntWhorreka,, N| C ^~~~~~~~~~~C L_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:S562T:E15P:S /warning: sinitializer order does not match the declaration order [-Wreorder-ctor]i zeof(T)) {562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t| i group(groupd (tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:e34a:d7s:( nnote: tin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh reads), 34t | i d I n B l opcrki(mtsh(rteiadd,I dnxt.hxr)e,a dgsr,o u&pr(ignrgo-u>pp)r,e v ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~& r i| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g ->nex t563, | a r g ss-t>espeSnidzbeu(fnfc,c laSrhgmse-m>.rceocmvmb.ubfuff,f Sairzgess-[>NrCeCdLO_pPARrOgT,O _0S,I MaPrLgEs]-/>connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBNCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:i562n:g15-:> pwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e v, &ring->ne x562t | , a r gtsi-d>(steindd)b,u fnft,h raeragdss-(>nrtehcrveads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr =/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 77 warnings generated when compiling for gfx908. warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ MPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h):,514 :g9r:o uwarning: pvariable 'offset' set but not used [-Wunused-but-set-variable]( gro u514p | ) , | i ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~n t | o warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3f fset 510= | t i d ;s t e| p ^S ize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp :| 1 group(group: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h::2933:: 7warning: :field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 507 | 33 | t i d ( tpirdi)m,s (nttihdr,e andtsh(rnthreaedasd)s,, w&irdi(ntgi-d>%pWrAeRvP,_ S&IrZiEn)g,- >wnaerxpt(,t iadr/gWsA-R>Ps_eSnIdZbEu)f,f , | a ~~~~~~~~~~~~~~~~~~r g s| - stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)> recv b508u | f f , awragrsp-I>nrBeldoOcpkA(rtgh,r e0a,d Iadrxg.sx-/>WcAoRnPn_ISnIdZeEx),, a r| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s - >| c warp(tid/WARP_SIZEo nnIn d509e | x ) ; f| l ^a gThre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ha:d92(:(5t:i dnote: %in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here4 )== 392) | , g r oruupn(Rgirnogu ( asrtgesp)S;i z e| ( ^n cclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:.202c:o53m:m .note: bin instantiation of member function 'RunWorkElement, 1, 1>::run' requested hereu ffSi z202e | s [ N C C L _ P RROuTnOW_oLrLk1E2l8e]m/eNnCtC{( ) .| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u n (| w group(groupe ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp :note: 8in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here: 1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 33 | 8 | I M P L _pCrOiLmLs_(FtUiNdC,( RnetdhurceeaSdcsa,t t&erri,n gR-I>NpGr,e vL,L 1&2r8i,n gP-r>onde,x tu,i natr3g2s_-t>)s e n| d^b uff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a391r:g95s:- >note: rexpanded from macro 'IMPL_COLL_FUNC'e cvbuff ,391 | a r gRsu-n>WroerdkO,c otnynpIen,d eFxu,n ca#r#gdse-v>rceodnonpI;, N| C ^C L_ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hO:_92#:#5a:l gnote: oin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here, NCC L92_ | P R O T Or_u#n#Rpirnogt,( )R.erduOnp(,& nPcrcoltSohLmLe1m2.8w>o(rakr)g;s )\; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, daIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ ta2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ 7 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), In file included from gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppo:u1p: (In file included from g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o10u: pIn file included from )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h,: 167 : | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~: 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groIn file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppp:)1,: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~~~~~~~10 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::167562: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h60::562 :note: 15field 'group' will be initialized after field 'stepSize': warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~u p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:u562n:W15o:r kwarning: Einitializer order does not match the declaration order [-Wreorder-ctor]l ements(()n.trhurne(awdes));, t| i ^d InBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpph:r4e:a1d:I dnote: xin instantiation of member function 'RunWork, 2, 2>::run' requested here. x), gr o4u | pI(MgPrLo_uCpO)L,L _ F| U ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N C (| A tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l lRed u563c | e , C sOtLeLpNSEiTz_eD(InRcEcClTS,h SImMePmL.Ec,o mPmr.ebMuuflfSSuimz,e si[nNtC8C_Lt_)P R O| T^O _SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h]:/391N:C95C:L _note: Sexpanded from macro 'IMPL_COLL_FUNC'T EPS/size o391f | ( T )R)u n{W o r| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~< n c| c group(groupl Func##fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:,655 :t11y:p enote: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Func# #655d | e v r e d o p < t y pper>i,m sN(CtCiLd_-AtLiGdOS_t#a#ratlRgeod,u cNeC,C Ln_TPhRrOeTaOd_s#R#epdruocteo,> (n)u.lrlupnt(r&,n c&cdliSrhemcetm-.>woourtk,) ;a r\g s -| > ^s endbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562a:r15g:s -note: >field 'nthreads' will be initialized after field 'tidInBlock'r ecvbuff ,562 | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202):,53 :n tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer eads (202n | t h r e a d s ) ,R utniWdoIrnkBElloecmke(ntth (| ) ^~~~~~~~~~~~~~~~~. run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:w562e:)60;: note: | field 'group' will be initialized after field 'stepSize' ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp | : 5 : 1 :t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested here( tid )5, | InMtPhLr_eCaOdLsL(_nFtUhNrCe(aAdlsl)R,e dtuicdeI,n BClOoLcLkN(EtTh_rDeIaRdEICdTx,. xS)I,M PgLrEo,u pP(rgerMouulpS)u,m , | u ^~~~~~~~~~~i nt8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLEIn file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :P1r: eIn file included from M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:l10S: uIn file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h,: 167u: i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t5628:_15t:) warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95 :562 | note: expanded from macro 'IMPL_COLL_FUNC' tid( t391i | d ) ,R unntWhorrekau,p (NgCrCoLu_pA)L,G O _| # ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~# a l| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o , NCC L563_ | P R O T Os_t#e#ppSriozteo(>n(c)c.lrSuhnm(e&mn.cccolmSmh.mbeumf.fwSoirzke)s;[ N\C C L| _ ^P ROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P15L:E ]note: /field 'nthreads' will be initialized after field 'tidInBlock'N CCL_S T562E | P S / s itziedo(ft(iTd))), {n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~x ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g roup( g563r | o u p ) ,s t e| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S i z| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( ncclS h563m | e m . c osmtme.pbSuifzfeS(inzcecsl[SNhCmCeLm_.PcRoOmTmO._bSuIfMfPSLiEz]e/sN[CNCCLC_LS_TPERPOST/Os_iSzIeMoPfL(ET])/)N C{C L _| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T E P| S group(group/ sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h): 626{: 9 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | group(group 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 666 : 9p:r inote: min instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (tid- t666i | d S t a r t S c aptrtiemrs,( tniTdh,r enaTdhsrSecaadtstGeart,h eNrU,L Ld,i rdeicrte-c>tu-p>,u pN,U LaLr,g sa-r>gsse-n>dsbeunfdfb,u fafr,g sa-r>grse-c>vrbeucfvfb,u f f| , ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::53202:: 53note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R u nRWuonrWkoErlkeEmleenmtet(o)>.(r)u.nr(uwne()w;e ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp::15:: 1note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | I5M | PILM_PCLO_LCLO_LFLU_NFCU(NACl(lARleldRuecdeu,c eC,O LCLONLELTN_EDTI_RDEICRTE,C TS,I MSPILMEP,L EP,r ePMruelMSuulmS,u mu,i nuti8n_tt8)_ t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::95391:: 95note: :expanded from macro 'IMPL_COLL_FUNC' note: expanded from macro 'IMPL_COLL_FUNC' 391 | 391R | u n WRournkWy,p eN>C,C LN_CACLLG_| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ AOLGO__####aallggoo,, NNCCCCLL__PPRROOTTOO__####pprroottoo>>(())..rruunn((&&nnccccllSShhmmeemm..wwoorrkk));; \\ | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6060:: note: note: field 'group' will be initialized after field 'stepSize'field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement562( | ) . r u nt(iwde()t;i d )| , ^ nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppe:a4d:s1(:n tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer ead s4) | ,I MtPiLd_ICnOBLlLo_cFkU(NtCh(rAelaldRIeddxu.cxe),, CgOrLoLuNpE(Tg_rDoIuRpE)C,T , | S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I M P| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)E , PreM u563l | S u m , sitnetp8S_itz)e ( n| c^c lSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnThre:a562d:s15G:a twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]e r, direct->up, N562U | L L , atrigds(-t>isde)n,d bnutfhfr,e aadrsg(sn-t>hrreecavdbsu)f,f ,t i d| I ^n Block(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:I dnote: xin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here. x), g202r | o u p ( g r o u pR)u,n W o| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~k E l| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m ent.(c)o.mrmu.nb(uwfef)S;i z e| s ^[ NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppR:O4T:O1_:S Inote: Min instantiation of member function 'RunWork, 2, 2>::run' requested hereP LE]/ N4C | CILM_PSLT_ECPOSL/Ls_iFzUeNoCf((ATl)l)R e{d u c| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, C| O group(groupL LNET_DIRECT, SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:L666E:,9 :P rnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereM ulSum ,666 | i n t 8 _ t ) p| r^i ms(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391,: 95n:T hnote: rexpanded from macro 'IMPL_COLL_FUNC'e adsGath e391r | , dRiurneWcotr-k> steynpdeb,u fFfu,n ca#r#gdse-v>rreedcovpb ,| ^N CCL_ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:_202#:#53a:l gnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, NCC L202_ | P R O T O _ # # pRruontWoo>r(k)E.lreumne(n&tn:(562):.15r:u nnote: (field 'nthreads' will be initialized after field 'tidInBlock'w e); | 562 ^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppi:d5(:t1i:d )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here nthr e5a | dIsM(PnLt_hCrOeLaLd_sF)U,N Ct(iAdlIlnRBeldouccke(,t hCrOeLaLdNIEdTx_.DxI)R,E CgTr,o uSpI(MgPrLoEu,p )P,r e M| u ^~~~~~~~~~~~~~~~~l Su/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:,562 :u60i:n tnote: 8field 'group' will be initialized after field 'stepSize'_ t) | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t391i:d95(:t inote: dexpanded from macro 'IMPL_COLL_FUNC') , nthre a391d | s ( nRtuhnrWeoardks<)n,c ctliFduInncB#l#ofcukn(ct,h rteyapdeI,d xF.uxn)c,# #gdreovurpe(dgorpo ,| ^~~~~~~~~~~N CCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i15d:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]n threads(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~r ea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x60.:x )note: ,field 'group' will be initialized after field 'stepSize' group(group) ,562 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d (tid), 563n | t h r e asdtse(pnStihzree(andcsc)l,S htmiedmI.ncBolmomc.kb(utfhfrSeiazdeIsd[xN.CxC)L,_ PgRrOoTuOp_(SgIrMoPuLpE)],/ N C| C ^~~~~~~~~~~L _STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.halgo,: 562N:C15C:L _warning: Pinitializer order does not match the declaration order [-Wreorder-ctor]R OTO_##proto >562( | ) . r u nt(i&dn(ctcildS)h,m enmt.hwroerakd)s;( n\t h r| e ^a ds), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I15n:B lnote: ofield 'nthreads' will be initialized after field 'tidInBlock'c k(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s ), ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~~~~~~~[ NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562P:R60O:T Onote: _field 'group' will be initialized after field 'stepSize'S IMP L562E | ] / N C CtLi_dS(TtEiPdS)/,s inztehorfe(aTd)s)( n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ) group(group, tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r677e:a11d:I dnote: xin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. x), gr o677u | p ( g r o u p ) , p| r ^~~~~~~~~~~i ms(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h563: | 562 : 15 : swarning: tinitializer order does not match the declaration order [-Wreorder-ctor]e pSize(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here563 | s626t | e p S i z e ( n cpcrliSmhsm(etmi.dc-otmimd.SbtuafrftSSiczaetst[eNrC,C Ln_TPhRrOeTaOd_sSSIcMaPtLtEe]r/,N CNCULL_LS,T EdPiSr/escitz-e>oufp(,T )a)r g{s - >| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e n d| b group(groupu ff, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:-677>:r11e:c vnote: bin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu ff, | 677 ^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53p:r inote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres (tid -202t | i d S t a r t B cRausntW,o rnkTEhlreemaednstBAolugto,, dPirroetcot>-(>)d.orwunn,( waer)g;s - >| s ^e ndbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :a6r:g1s:- >note: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree cvbu f6f | ,I M P| L ^_ COLL_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C202(:A53l:l Rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered uce, 202C | O L L N E T _ D IRRuEnCWTo,r kSEIlMePmLeEn,t ()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:r391u:n95(:w enote: )expanded from macro 'IMPL_COLL_FUNC'; | ^ 391 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppu:n4W:o1r:k , 2, 2>::run' requested herec clFu n4c | #I#MfPuLn_cC,O LtLy_pFeU,N CF(uAnlcl#R#eddeuvcree,d oCpO_,D INRCECCLT_,A LSGIOM_P#L#Ea,l gPor,e MNuClCSLu_mP,R OiTnOt_8#_#tp)r o t| o^> ().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n391(:&95n:c cnote: lexpanded from macro 'IMPL_COLL_FUNC'S hmem.w o391r | k ) ;R u\n W o| r ^k (,n tNhCrCeLa_dAsL)G,O _t#i#daIlngBol,o cNkC(CtLh_rPeRaOdTIOd_x#.#xp)r,o tgor>o(u)p.(rgurno(u&pn)c,c l S| h ^~~~~~~~~~~~~~~~~m em.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:)60;: \note: field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dnote: (field 'nthreads' will be initialized after field 'tidInBlock't id), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,562 | N C C L _tAiLdG(Ot_i#d#)a,l gnot,h rNeCaCdLs_(PnRtOhTrOe_a#d#sp)r,o ttoi>d(I)n.Brluonc(k&(ntchcrleSahdmIedmx..wxo)r,k )g;r o\u p (| g ^r oup), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 562 :| 15 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562s | t e p S itzied((ntcicdl)S,h mnetmh.rceoamdms.(bnutfhfrSeiazdess)[,N CtCiLd_IPnRBOlToOc_kS(ItMhPrLeEa]d/INdCxC.Lx_)S,T EgPrSo/uspi(zgeroofu(pT)),) {| ^~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 group(group: 60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 687 : 11t:i dnote: (in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret id), n687t | h r e a d s ( n t h rperaidmss)(,t itdi-dtIindBSltoacrkt(Btcharseta,d IndTxh.rxe)a,d sgBrcoauspt(,g r&oduipr)e,c t -| > ^~~~~~~~~~~o ut, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::39115::95 :warning: initializer order does not match the declaration order [-Wreorder-ctor]note: expanded from macro 'IMPL_COLL_FUNC' 391562 | | R u ntWiodr(ktI,d xN.CxC)L,_ AgLrGoOu_p#(#garloguop,) ,N C C| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ P R| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T O_##p r563o | t o > ( )s.treupnS(i&znec(cnlcSchlmSehmm.ewmo.rcko)m;m .\b u f| f ^S izes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:P Rnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'T O_SIM P562L | E ] / N CtCiLd_(StTiEdP)S,/ snitzheroefa(dTs)()n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s )| , group(group tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:(687t:h11r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI dx.x), 687g | r o u p ( g r o u p )p,r i m| s ^~~~~~~~~~~~~~~~~( tid-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:S60t:a rnote: tfield 'group' will be initialized after field 'stepSize'B cast, 562n | T h r e atdisdB(ctaisdt),, &ndtihrreecatd-s>(onutth,r enaudlsl)p,t rt,i daIrngBsl-o>cske(ntdhbruefafd,I daxr.gxs)-,> rgercovubpu(fgfr,o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBl 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement562(:)15.:r uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]( we); | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp : 4t:i1d:( tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered ), nt h4r | eIaMdPsL(_nCtOhLrLe_aFdUsN)C,( AtlildRIendBulcoec,k (CtOhLrLeNaEdTI_dDxI.RxE)C,T ,g rSoIuMpP(LgEr,o uPpr)e,M u l| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u m ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i nt8_t )563 | | ^ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i391z:e95(:n cnote: cexpanded from macro 'IMPL_COLL_FUNC'l Shmem. c391o | m m .RbuunfWfoSrikz), {N C C| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ A L| G group(groupO _##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hN:C641C:L11_:P Rnote: Oin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT O_##pro t641o | > ( ) . r u n ( & n cpcrliSmhsm(etmi.dw-otrikd)S;t a\r t R| e ^d uce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:T562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock'R educe ,562 | d i r e ctti-d>(dtoiwdn),, &ndtihrreecatd-s>(onutth,r eaardgss)-,> steinddIbnuBflfo,c ka(rtghsr-e>ardeIcdvxb.uxf)f,, g r| o ^u p(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 : 53| : ^~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60 :202 | note: field 'group' will be initialized after field 'stepSize' 562R | u n W o rtkiEdl(etmiedn)t,< Fnnt,h rTe,a dRse(dnOtph,r eAaldgso),, PtriodtIon>B(l)o.crku(nt(hwree)a;d I d| x ^. x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppo:u6p:(1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep ), | ^~~~~~~~~~~6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:E562T:_15D:I Rwarning: Einitializer order does not match the declaration order [-Wreorder-ctor]C T, SIM P562L | E , P rteiMdu(ltSiudm),, inntth3r2e_atd)s ( n| t^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)391,: 95t:i dnote: Iexpanded from macro 'IMPL_COLL_FUNC'n Block( t391h | r e aRduIndWxo.rxk)<,n cgcrloFuupn(cg#r#ofuupn)c,, t| y ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p e ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)F unc# #563d | e v r e dsotpe(,n cNcClCSLh_mAeLmG.Oc_o#m#ma.lbguof,f SNiCzCeLs_[PNRCOCTLO__P#R#OpTrOo_tSoI>M(P)L.Er]u/nN(C&CnLc_cSlTSEhPmSe/ms.iwzoerokf)(;T )\) {| ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hfield 'nthreads' will be initialized after field 'tidInBlock': 641:11: 562note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here t i641d | ( t i d ) , n t h rperaidmss((nttihdr-etaiddsS)t,a rttiRdeIdnuBcleo,c kn(TthhrreeaaddsIRdexd.uxc)e,, gdrioruepc(tg-r>oduopw)n,, &| d ^~~~~~~~~~~~~~~~~i rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:-562>:o60u:t ,note: field 'group' will be initialized after field 'stepSize'a rgs- >562s | e n d b utfifd,( tairdg)s,- >nrtehcrvebaudfsf(,n t h| r ^e ads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202I:n53B:l onote: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herek (thr e202a | d I d x . x ) , RgurnoWuopr(kgErloeumpe)n,t < F| n ^~~~~~~~~~~, T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nthrea d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~~~~~~~x .x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :g60r:o unote: pfield 'group' will be initialized after field 'stepSize'( grou p562) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id), n563t | h r e a dsst(enptShirzeea(dnsc)c,l SthimdeImn.Bcloomcmk.(btuhfrfeSaidzIedsx[.NxC)C,L _gPrRoOuTpO(_gSrIoMuPpL)E,] / N| C ^~~~~~~~~~~C L_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rect->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork,: 562N:C15C:L _warning: Ainitializer order does not match the declaration order [-Wreorder-ctor]L GO_##algo, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c k| ( ^t hreadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(grou p562) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id), 563n | t h r e asdtse(pnStihzree(andcsc)l,S htmiedmI.ncBolmomc.kb(utfhfrSeiazdeIsd[xN.CxC)L,_ PgRrOoTuOp_(SgIrMoPuLpE)],/ N C| C ^~~~~~~~~~~~~~~~~L _STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:/562s:i60z:e onote: ffield 'group' will be initialized after field 'stepSize'( T)) { 562| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt id(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d677s:(11n:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads), 677t | i d I n B l o c k ( tphrriemasd(Itdixd.-xt)i,d SgtraorutpB(cgarsotu,p )n,T h r| e ^~~~~~~~~~~a dsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hinitializer order does not match the declaration order [-Wreorder-ctor]: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562563: | 60 : note: field 'group' will be initialized after field 'stepSize's tepSiz e562( | n c c l Sthimde(mt.icdo)m,m .nbtuhfrfeSaidzse(sn[tNhCrCeLa_dPsR)O,T Ot_iSdIIMnPBLlEo]c/kN(CtChLr_eSaTdEIPdSx/.sxi)z,e ogfr(oTu)p)( g{r o u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | group(group| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_ALG:O562_:#15#:a lwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]o , NCCL_PROTO_ #562# | p r o t ot>i(d)(.triudn)(,& nnctchlrSehamdesm(.nwtohrrke)a;d s\) , | t ^i dInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a dIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( nthre a563d | s ) , tsitdeIpnSBilzoec(kn(ctchlrSehamdeImd.xc.oxm)m,. bgurfofuSpi(zgerso[uNpC)C,L _ P| R ^~~~~~~~~~~~~~~~~O TO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L60E:] /note: Nfield 'group' will be initialized after field 'stepSize'C CL_ST E562P | S / s i zteiodf((tTi)d)) ,{ n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups (nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:B655l:o11c:k (note: tin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh readId x655. | x ) , g r o u p ( gprroiumps)(,t i d| - ^~~~~~~~~~~t idStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,warning: initializer order does not match the declaration order [-Wreorder-ctor]N CCL_ALGO_##a l562g | o , N CtCiLd_(PtRiOdT)O,_ #n#tphrroetaod>s(()n.trhurne(a&dnsc)c,l SthimdeImn.Bwloorckk)(;t h\r e a| d ^I dx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock'( gro u562p | ) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d), nth r563e | a d s ( nsttherpeSaidzse)(,n ctcildSIhnmBelmo.ccko(mtmh.rbeuafdfISdixz.exs)[,N CgCrLo_uPpR(OgTrOo_uSpI)M,P L E| ] ^~~~~~~~~~~~~~~~~/ NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:S60T:E Pnote: Sfield 'group' will be initialized after field 'stepSize'/ size o562f | ( T ) ) t{i d (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d )| , group(group nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r626e:a9d:s )note: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tidInBl o626c | k ( t h r e a d Ipdrxi.mxs)(,t igdr-otuipd(SgtraorutpS)c,a t t| e ^~~~~~~~~~~r , nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pwarning: )initializer order does not match the declaration order [-Wreorder-ctor], | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i ds(tteipdS)i,z en(tnhcrcelaSdhsm(enmt.hcroemamd.sb)u,f ftSiidzIensB[lNoCcCkL(_tPhRrOeTaOd_ISdIxM.PxL)E,] /gNrCoCuLp_(SgTrEoPuSp/)s,i z e| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f ( T| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~563 | | group(group stepSize(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:c666c:l9S:h mnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem .com m666. | b u f f S i z e sp[rNiCmCsL(_tPiRdO,T On_TShIrMePaLdEs]G/aNtChCeLr_,S TdEiPrSe/csti-z>euopf,( TN)U)L L{, a| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g s -| > group(groups endbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:r687g:s11-:> rnote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec vbuff, 687 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :p202r:i53m:s (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d-ti d202S | t a r t B c a s tR,u nnWTohrrkeEaldesmBecnatspout, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562i:n15t:6 4warning: _initializer order does not match the declaration order [-Wreorder-ctor]t ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 391 : 95 : tnote: iexpanded from macro 'IMPL_COLL_FUNC'd (tid), 391n | t h rReuandWso(rnkto,u pN)C,C L _| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L G O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #algo ,563 | N C C L _sPtReOpTSOi_z#e#(pnrcoctloS>h(m)e.mr.ucno(m&mn.cbculfSfhSmiezme.sw[oNrCkC)L;_ P\R O T| O ^_ SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:]562/:N15C:C Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'S TEPS/ s562i | z e o f (tTi)d)( t{i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h group(groupr eads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e687a:d11s:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret idInBl o687c | k ( t h r e a d I d xp.rxi)m,s (gtriodu-pt(igdrSotuapr)t,B c a| s ^~~~~~~~~~~~~~~~~t , nT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s Bnote: cfield 'group' will be initialized after field 'stepSize'a st, & d562i | r e c t -t>iodu(tt,i dn)u,l lnptthrr,e aadrsg(sn-t>hsreenaddbsu)f,f ,t iadrIgnsB-l>orcekc(vtbhurfefa,d I d| x ^. x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:g202r:o53u:p )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ em.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]n Block(thre a562d | I d x . xt)i,d (gtriodu)p,( gnrtohurpe)a,d s (| n ^~~~~~~~~~~~~~~~~t hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562):,60 :t inote: dfield 'group' will be initialized after field 'stepSize'I nBlock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), tidI n563B | l o c k (sttherpeSaidzIed(xn.cxc)l,S hgmreomu.pc(ogmrmo.ubpu)f,f S i| z ^~~~~~~~~~~ es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562d:15: unote: cfield 'nthreads' will be initialized after field 'tidInBlock'e , COL L562NET_DI | R E C T ,t iSdI(MtPiLdE),, PnrtehMruelaSdusm(,n tuhirneta3d2s_)t,) t i| d^I nBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:I dnote: xexpanded from macro 'IMPL_COLL_FUNC'. x), gr o391u | p ( gRruonuWpo)r,k < n| c ^~~~~~~~~~~~~~~~~c lFun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:#562#:f60u:n cnote: ,field 'group' will be initialized after field 'stepSize' type, 562F | u n c # #tdiedv(rteiddo)p,< tnytpher>e,a dNsC(CnLt_hArLeGaOd_s#)#,a ltgiod,I nNBClCoLc_kP(RtOhTrOe_a#d#Ipdrxo.txo)>,( )g.rrouunp((&gnrcoculpS)h,m e m| . ^~~~~~~~~~~w ork); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562 | :t562i:d15(:t iwarning: d)initializer order does not match the declaration order [-Wreorder-ctor], nthreads(nthread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~k (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nul/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:p562t:r15,: &warning: dinitializer order does not match the declaration order [-Wreorder-ctor]i rect-> o562u | t , a rtgisd-(>tsiedn)d,b unftfh,r eaardgss(-n>trhercevabdusf)f,, t i| d ^I nBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d Inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested herex .x), 202g | r o u p ( g r o uRpu)n,W o r| k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~E l e| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e nt.(c)o.mrmu.nb(uwfef)S;i z e| s ^[ NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppR:O8T:O1_:S Inote: Min instantiation of member function 'RunWork, 2, 2>::run' requested hereP LE]/ N8C | CILM_PSLT_ECPOSL/Ls_iFzUeNoCf((ATl)l)R e{d u c| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, C| O group(groupL LNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:C666T:,9 :S Inote: Min instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereP LE, P666r | e M u l S u m , pirnitm6s4(_tti)d , | n^T hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391G:a95t:h enote: rexpanded from macro 'IMPL_COLL_FUNC', dire c391t | - > uRpu,n WNoUrLkL<,n cacrlgFsu-n>cs#e#nfdubnucf,f ,t yapreg,s -F>urnecc#v#bduefvfr,e d o| p ^< type>,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :N202C:C53L:_ Anote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereG O_# #202a | l g o , N C C LR_uPnRWOoTrOk_E#l#epmreontto<>F(n),. rTu,n (R&endcOcpl,S hAmlegmo.,w oPrrko)t;o >\( ) .| r ^u n(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h;: 562 :| 15 ^: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6 :5621 | : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here tid (6t | iIdM)P,L _nCtOhLrLe_aFdUsN(Cn(tAhlrleRaeddsu)c,e ,t iCdOILnLBNlEoTc_kD(ItRhErCeTa,d ISdIxM.PxL)E,, gPrroeuMpu(lgSruomu,p )i,n t 3| 2 ^~~~~~~~~~~~~~~~~_ t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| :^562 :60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :field 'group' will be initialized after field 'stepSize'391 :95: note: 562expanded from macro 'IMPL_COLL_FUNC' | t i391d | ( t iRdu)n,W onrtkhg,r oNuCpC(Lg_rAoLuGpO)_,# # a| l ^~~~~~~~~~~g o, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562 | : 562t:i15: warning: initializer order does not match the declaration order [-Wreorder-ctor] d(tid), nthrea d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~~~~~~~x .x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o60u:p (note: gfield 'group' will be initialized after field 'stepSize'r oup), 562 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d(tid )563, | n t h rsetaedpsS(inzteh(rnecacdlsS)h,m etmi.dcIonmBml.obcukf(ftShirzeeasd[INdCxC.Lx_)P,R OgTrOo_uSpI(MgPrLoEu]p/)N,C C L| _ ^~~~~~~~~~~S TEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ llReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork15,: Nwarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]C L_ALGO_##a l562g | o , N CtCiLd_(PtRiOdT)O,_ #n#tphrroetaod>s(()n.trhurne(a&dnsc)c,l SthimdeImn.Bwloorckk)(;t h\r e a| d ^I dx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(15g:r onote: ufield 'nthreads' will be initialized after field 'tidInBlock'p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(t i563d | ) , n tshtreepaSdisz(en(tnhcrcelaSdhsm)e,m .tciodmImn.BbluofcfkS(itzherse[aNdCICdLx_.PxR)O,T Og_rSoIuMpP(LgEr]o/uNpC)C,L _ S| T ^~~~~~~~~~~~~~~~~E PS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:i562z:e60o:f (note: Tfield 'group' will be initialized after field 'stepSize') ) { 562| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt id(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :n677t:h11r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (nthre a677d | s ) , t i d I n B lporcikm(st(htrieda-dtIiddxS.txa)r,t Bgcraosutp,( gnrTohurpe)a,d s B| c ^~~~~~~~~~~a st, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | toup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,562 | N C C L _tAiLdG(Ot_i#d#)a,l gnot,h rNeCaCdLs_(PnRtOhTrOe_a#d#sp)r,o ttoi>d(I)n.Brluonc(k&(ntchcrleSahdmIedmx..wxo)r,k )g;r o\u p (| g ^r oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 15 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562 | s t etpiSdi(ztei(dn)c,c lnSthhmreema.dcso(mnmt.hbruefafdSsi)z,e st[iNdCICnLB_lPoRcOkT(Ot_hSrIeMaPdLIEd]x/.NxC)C,L _gSrToEuPpS(/gsriozuepo)f,( T )| ) ^~~~~~~~~~~~~~~~~ { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 562 :| 60 group(group: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655t:i11d:( tnote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered ), nt h655r | e a d s ( n t h r e apdrsi)m,s (ttiiddI-ntBildoSctka(rtthRreedaudcIed,x .nxT)h,r egardosuRpe(dgurcoeu,p )n,u l l| p ^~~~~~~~~~~t r, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThirde(atdisdB)c,a sntt,h r&edaidrse(cntt-h>roeuatd,s )d,i rteicdtI-n>Bdloowcnk,( tahrrgesa-d>Isdexn.dxb)u,f fg,r oaurpg(sg-r>oruepc)v,b u f| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :56353 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here step S202i | z e ( n c c l S hRmuenmW.ocrokmEml.ebmuefnftSC(L)_.SrTuEnP(Sw/es)i;z e o| f ^( T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :| 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 1 :| group(groupnote: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:L655L:_11F:U Nnote: Cin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( AllRed u655c | e , C O L L N E T _pDrIiRmEsC(Tt,i dS-ItMiPdLSEt,a rPtrReeMduulcSeu,m ,n Tuhirneta3d2s_Rte)d u c| e^, nul/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:p391t:r95,: ¬e: dexpanded from macro 'IMPL_COLL_FUNC'i rect->o u391t | , aRrugnsW-o>rskecr,e ctvybpuef,f ,F u n| c ^# #devredo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:<202t:y53p:e >note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here NCCL _202A | L G O _ # # a l gRou,n WNoCrCkLE_lPeRmOeTnOt_<#F#np,r oTt,o >R(e)d.Orpu,n (A&lngcoc,l SPhrmoetmo.>w(o)r.kr)u;n (\w e )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :note: 9field 'nthreads' will be initialized after field 'tidInBlock': 1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 9 | tIiMdP(Lt_iCdO)L,L _nFtUhNrCe(aAdlsl(Rnetdhurceea,d sC)O,L LtNiEdTI_nDBIlRoEcCkT(,t hSrIeMaPdLIEd,x .Pxr)e,M uglrSouump,( gurionutp6)4,_ t )| ^~~~~~~~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 391note: :field 'group' will be initialized after field 'stepSize'95 : note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391t | i d (RtuindW)o,r kn,, gNrCoCuLp_(AgLrGoOu_p#)#,a l g| o ^~~~~~~~~~~, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.htid):,562 :n15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a ds(nthreads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~B lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;15 :\ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock't id(ti d562) | , n t htrieda(dtsi(dn)t,h rntehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p), | 563 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t60e:p Snote: ifield 'group' will be initialized after field 'stepSize'z e(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:P Rwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]T O_SIMPL E562] | / N C C Lt_iSdT(EtPiSd/)s,i znetohfr(eTa)d)s ({n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group) , tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:o687c:k11(:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree adIdx .687x | ) , g r o u p ( g rporuipm)s,( t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~- t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S tartB c563a | s t , nsTtherpeSaidzseB(cnacsctl,S h&mdeimr.eccotm-m>.obuutf,f Sniuzlelsp[tNrC,C La_rPgRsO-T>Os_eSnIdMbPuLfEf],/ NaCrCgLs_-S>TrEePcSv/bsuifzfe,o f (| T ^) ) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 202 :| 53 group(group: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :202687 | : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Run W687o | r k E l e m e n t sr(te),a. drnsuT)nh,(r weteai)dd;sI Bn cB| al ^so tc,k (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp&t:dh6ir:re1ea:cd tInote: -din instantiation of member function 'RunWork, 2, 2>::run' requested here>x o.u xt6), | , I nMguPrlLol_upCptO(rL,gL r_aoFruUgpNs)C-,(> As le| ln ^~~~~~~~~~~~~~~~~Rd ebduufcfe,, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :aC562rO:gL60sL:-N >Enote: rTfield 'group' will be initialized after field 'stepSize'e_ cDvIbRuE fC562fT | ,, S | I ^Mt PiLdE(,t iPdr)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he,:M 202unl:tSh53ur:me ,anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereis n(tn3t2 h_202rt | e) a d s| )^ , t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h i:Rd391uI:nn95WB:ol ronote: kcexpanded from macro 'IMPL_COLL_FUNC'Ek l(etm he391rn | et a< dFRInud,nx W.Tox,r) k,R< engdcrOcoplu,Fp u(Anglcrg#oo#u,fp u)Pn,rc o, t | ot ^~~~~~~~~~~>y (p)e.,r uFnu(nwce#)#;d e v| r ^e dop, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppN:C8C:L1_:A Lnote: Gin instantiation of member function 'RunWork, 2, 2>::run' requested hereO _##alg o8, | INMCPCLL__CPORLOLT_OF_U#N#Cp(rAoltloR>e(d)u.creu,n (C&OnLcLcNlESTh_mDeImR.EwCoTr,k )S;I M\P L E| , ^ PreM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:l562S:u15m:, note: ifield 'nthreads' will be initialized after field 'tidInBlock'n t64_ t562) | | ^ tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :n95t:h rnote: eexpanded from macro 'IMPL_COLL_FUNC'a ds(nthre a391d | s ) ,R utniWdoIrnkB:,60 :N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _ALG O562_ | # # a l gtoi,d (NtCiCdL)_,P RnOtThOr_e#a#dpsr(onttoh>r(e)a.drsu)n,( &tnicdcIlnSBhlmoecmk.(wtohrrke)a;d I\d x .| x ^) , grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pnote: )field 'nthreads' will be initialized after field 'tidInBlock', | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562_:C15O:L Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]F UNC(AllRe d562u | c e , CtOiLdL(NtEiTd_)D,I RnEtChTr,e aSdIsM(PnLtEh,r ePardesM)u,l Stuimd,I nuBilnotc3k2(_tth)r e a| d^I dx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC'( group) ,391 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R u n| W tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o rk[,N CNCCLC_LP_RAOLTGOO__S#I#MaPlLgEo],/ NNCCCCLL__SPTREOPTSO/_s#i#zperooft(oT>)()) .{r u n| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~& n c| c group(groupl Shmem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:r641k:)11;: \note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h641: | 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' pr i562m | s ( t i dt-itdi(dtSitda)r,t Rnetdhurceea,d sn(TnhtrheraedasdRse)d,u ctei,d IdniBrleocctk-(>tdhorwena,d I&ddxi.rxe)c,t -g>roouutp,( garrogusp-)>,s e n| d ^~~~~~~~~~~~~~~~~b uf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 :a60r:g snote: -field 'group' will be initialized after field 'stepSize'> recv b562u | f f , t| i ^d (tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres (nth r202e | a d s ) , t i dRIunnBWloorckkE(ltehmreenatd().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdsRed:u562c:e15,: nwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]l lptr, &di r562e | c t - > otuitd,( tairdg)s,- >nstehnrdebaudfsf(,n tahrrgesa-d>sr)e,c vtbiudfIfn,B l o| c ^k (thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202I:d53x:. xnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, gr o202u | p ( g r o u p ) ,R u n| W ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o r k| E tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l ement <563F | n , T ,s tReepdSOipz,e (Anlcgcol,S hPmreomt.oc>o(m)m..rbuunf(fwSei)z;e s [| N ^C CL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp_:P8R:O1T:O _note: Sin instantiation of member function 'RunWork, 2, 2>::run' requested hereI MPL E8] | /INMCPCLL__CSOTLELP_SF/UsNiCz(eAolfl(RTe)d)u c{e , | C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O L L| N group(groupE T_DIRECT, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:,626 :P9r:e Mnote: uin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel Sum, i n626t | 6 4 _ t ) | ^p rim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(391t:i95d:- tnote: iexpanded from macro 'IMPL_COLL_FUNC'd Star t391S | c a tRtuenrW,o rnkTduepv,r eadrogps<-t>yspeen>d,b uNfCfC,L _aArLgGsO-_>#r#eaclvgbou,f fN,C C L| _ ^P ROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:t202o:>53(:) .note: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu n(&nc c202l | S h m e m . w o rRku)n;W o\r k E| l ^e men/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:<562F:n15,: Tnote: ,field 'nthreads' will be initialized after field 'tidInBlock' RedO p562, | A l g ot,i dP(rtoitdo)>,( )n.trhurne(awdes)(;n t h| r ^e ads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppi:d7I:n1B:l onote: cin instantiation of member function 'RunWork, 2, 2>::run' requested herek (thre a7d | IIdMxP.Lx_)C,O LgLr_oFuUpN(Cg(rAolulpR)e,d u c| e ^~~~~~~~~~~~~~~~~, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:L562L:N60E:T _note: Dfield 'group' will be initialized after field 'stepSize'I REC T562, | S I M PtLiEd,( tPirde)M, nthreadusl(Snutmh,r euaidnst)3,2 _tti)d I n| B^l ock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d Inote: dexpanded from macro 'IMPL_COLL_FUNC'x .x), gr o391u | p ( gRruonuWpo)r,k < n| c ^~~~~~~~~~~c lFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuffds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s ), tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I nBlock( t563h | r e a d Isdtxe.pxS)i,z eg(rnocucpl(Sghrmoeump.)c,o m m| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~b u f| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S izes [563N | C C L _ PsRtOeTpOS_iSzIeM(PnLcEc]l/SNhCmCeLm_.ScToEmPmS./bsuifzfeSoifz(eTs)[)N C{C L _| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R O T| O group(group_ SIMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/:N655C:C11L:_ Snote: Tin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereE PS/siz e655o | f ( T ) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupp rims(tid-t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d666S:t9a:r tnote: Rin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree duce ,666 | n T h r e a d s Rperdiumcse(,t indu,l lnpTthrr,e a&ddsiGraetchte-r>,o udti,r eacrtg-s>-u>ps,e nNdUbLuLf,f ,a ragrsg-s>-s>ernedcbvubfuff,f ,a r g| s ^- >recvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202 :| 53 ^: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :20253 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202R | u n W o r k E l eRmuennWtog(o),. rPurno(twoe>)(;) . r| u ^n (we); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp ^: 7:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppnote: :in instantiation of member function 'RunWork, 2, 2>::run' requested here9 :1: note: 7in instantiation of member function 'RunWork, 2, 2>::run' requested here | IMPL _9C | OILMLP_LF_UCNOCL(LA_lFlURNeCd(uAclel,R eCdOuLcLeN,E TC_ODLILRNEECTT_,D ISRIEMCPTL,E ,S IPMrPeLMEu,l SPurme,M uuliSnutm3,2 _uti)n t 6| 4^_ t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| :^391 :95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hexpanded from macro 'IMPL_COLL_FUNC': 391:95: note: expanded from macro 'IMPL_COLL_FUNC'391 | Run W391o | r k , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, duirectp-)>,u p ,| ^~~~~~~~~~~~~~~~~N ULL, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:e562n:d60b:u fnote: ffield 'group' will be initialized after field 'stepSize', args->r e562c | v b u f ft,i d (| t ^i d), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53s:( nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh reads )202, | t i d I n B l oRcukn(WtohrrkeEaldeImdexn.tx<)F,n ,g rTo,u pR(egdrOopu,p )A,l g o| , ^~~~~~~~~~~ Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>rec:v562b:u15f:f ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :562202 | : 53 : note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d(tid )202, | n t h r e a d sR(unntWhorrekaEdlse)m,e nttiu(p)(.grruonu(pw)e,) ; | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp : 8 : 1s:t enote: pin instantiation of member function 'RunWork, 2, 2>::run' requested hereS ize(n c8c | lISMhPmLe_mC.OcLoLm_mF.UbNuCf(fASlilzReesd[uNcCeC,L _CPORLOLTNOE_TS_IDMIPRLEEC]T/,N CSCILM_PSLTEE,P SP/rseiMzueloSfu(mT,) )i n{t 6 4| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t ) | group(group| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hexpanded from macro 'IMPL_COLL_FUNC': 666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | R666u | n W o r k < n c cplrFiumnsc(#t#ifdu,n cn,T htryepaed,s GFautnhce#r#,d edvirreedcotp-<>tuypp,e >N,U LNLC,C La_rAgLsG-O>_s#e#nadlbguof,f ,N CaCrLg_sP-R>OrTeOc_v#b#upfrfo,t o >| ( ^) .run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:c202l:S53h:m enote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested here. work )202; | \ | ^ RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562E:l15e:m enote: nfield 'nthreads' will be initialized after field 'tidInBlock't r(e)a.drsu(nn(twher)e;a d s| ) ^, tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppB:l9o:c1k:( tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer ead I9d | xI.MxP)L,_ CgOrLoLu_pF(UgNrCo(uApl)l,R e d| u ^~~~~~~~~~~~~~~~~c e, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:L562L:N60E:T _note: Dfield 'group' will be initialized after field 'stepSize'I RECT, S562I | M P L E ,t iPdr(etMiudl)S,u mn,t hurienatd6s4(_ntt)h r e| a^d s), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I391n:B95l:o cnote: kexpanded from macro 'IMPL_COLL_FUNC'( thread I391d | x . xR)u,n Wgorroku, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:C562O:L15L:_ Fwarning: Uinitializer order does not match the declaration order [-Wreorder-ctor]N C(AllRedu c562e | , C O LtLiNdE(Tt_iDdI)R,E CnTt,h rSeIaMdPsL(En,t hPrreeaMdusl)S,u mt,i diInntB6l4o_ctk)( t h| r^e adId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:.391x:)95,: gnote: rexpanded from macro 'IMPL_COLL_FUNC'o up(gro u391p | ) , R u| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~W o r| k tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)< ncclF u563n | c # # f usntce,p Stiyzpee(,n cFculnSch#m#edme.vcroemdmo.pbz,e sN[CNCCLC_LA_LPGROO_T#O#_aSlIgMoP,L EN]C/CNLC_CPLR_OSTTOE_P#S#/psriozteoo>f(()T.)r)u n{( & n| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c l S| h group(groupm em.work); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h\: 687 :| 11 ^: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nt:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds(nthreads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o c k| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hread I563d | x . x ) ,s tgerpoSuipz(eg(rnocucpl)S,h m e| m ^~~~~~~~~~~~~~~~~. comm.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:s60[:N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _PROT O562_ | S I M P LtEi]d/(NtCiCdL)_,S TnEtPhSr/esaidzse(onft(hTr)e)a d{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I626d:x9.:x )note: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here group( g626r | o u p ) , | ^~~~~~~~~~~p rims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 15 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: warning: | initializer order does not match the declaration order [-Wreorder-ctor] tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 562 | s t e p Stiizde((tnicdc)l,S hnmtehmr.ecaodmsm(.nbtuhfrfeSaidzse)s,[ NtCiCdLI_nPBRlOoTcOk_(StIhMrPeLaEd]I/dNxC.CxL)_,S TgErPoSu/ps(igzreooufp()T,) ) | { ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | group(group 563 | stepSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n677c:c11l:S hnote: min instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree m.com m677. | b u f f S i z e s [ NpCrCiLm_sP(RtOiTdO-_tSiIdMSPtLaEr]t/BNcCaCsLt_,S TnETPhSr/esaidzseBocfa(sTt),) &{d i r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c t -| > group(groupo ut, dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:t641-:>11d:o wnote: nin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, args- >641s | e n d b u f f , a rpgrsi-m>sr(etcivdb-utfifd,S t a| r ^t Reduce, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:h202r:e53a:d snote: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree duce ,202 | d i r e c t - > dRouwnnW,o r&kdEilreemcetn-t>Ospe,n dAblugfof,, Parrogtso->>(r)e.crvubnu(fwfe,) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp202::1053::1 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: in instantiation of member function 'RunWork, 2, 2>::run' requested here 202 | 10 | I M P L _ C ORLuLn_WFoUrNkCE(lAelmleRnetdE(,) .PrruenM(uwleS)u;m , | h ^a lf) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp^: 10:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of member function 'RunWork, 2, 2>::run' requested here391 :95: note: 10expanded from macro 'IMPL_COLL_FUNC' | IMPL_C O391L | L _ FRUuNnCW(oArlkl ,h aNlCfC)L _ A| L^G O_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:a391l:g95o:, note: Nexpanded from macro 'IMPL_COLL_FUNC'C CL_PR O391T | O _ #R#upnrWootrok><(n)c.crluFnu(n&cn#c#cfluSnhcm,e mt.ywpoer,k )F;u n\c # #| d ^e vredop:, note: Nfield 'nthreads' will be initialized after field 'tidInBlock'C CL_ALG O562_ | # # a l gtoi,d (NtCiCdL)_,P RnOtThOr_e#a#dpsr(onttoh>r(e)a.drsu)n,( &tnicdcIlnSBhlmoecmk.(wtohrrke)a;d I\d x .| x ^) , gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock') , | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i60d:( tnote: ifield 'group' will be initialized after field 'stepSize'd ), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~, gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g60r:o unote: pfield 'group' will be initialized after field 'stepSize') , | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562,: 15n:T hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e adsBcast ,562 | & d i r etcitd-(>toiudt),, dnitrherceta-d>sd(onwtnh,r eaardgss)-,> steinddIbnuBflfo,c ka(rtghsr-e>ardeIcdvxb.uxf)f,, g r| o ^u p(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202 :| 53 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 202 | 563 | s t eRpuSniWzoer(knEclcelmSehnmteO(_)S.IrMuPnL(Ew]e/)N;C C L| _ ^S TEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppi:z10e:o1f:( Tnote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here) { | 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | I M| P group(groupL _COLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:A626l:l9R:e dnote: uin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec e, COL L626N | E T _ D I R E C Tp,r iSmIsM(PtLiEd,- tPirdeSMtualrStuSmc,a thtaelrf,) n T| h^r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:c391a:t95t:e rnote: ,expanded from macro 'IMPL_COLL_FUNC' NULL, d391i | r e cRtu-n>Wuopr,k usnecn#d#bfuufnfc,, atrygpse-,> rFeucnvcb#u#fdfe,v r e| d ^o p:,202 :N53C:C Lnote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereA LGO_ #202# | a l g o , N C CRLu_nPWRoOrTkOE_l#e#mpernottn(,) .Tr,u nR(e&dnOcpc,l SAhlmgeom,. wPorrokt)o;> (\) . r| u ^n (we);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8: 1562: | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here tid (8t | iIdM)P,L _nCtOhLrLe_aFdUsN(Cn(tAhlrleRaeddsu)c,e ,t iCdOILnLBNlEoTc_kD(ItRhErCeTa,d ISdIxM.PxL)E,, gPrroeuMpu(lgSruomu,p )i,n t 6| 4 ^~~~~~~~~~~~~~~~~_ t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562^: 60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95 :562 | note: expanded from macro 'IMPL_COLL_FUNC' tid( t391i | d ) ,R unntWhorrekau,p (NgCrCoLu_pA)L,G O _| # ^~~~~~~~~~~# algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:r562e:M15u:l Swarning: uinitializer order does not match the declaration order [-Wreorder-ctor]m , int64 _562t | ) | ^t id(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391):,95 :n tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eads(n t391h | r e aRdusn)W,o rtki| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) NCCL _563A | L G O _ #s#taelpgSoi,z eN(CnCcLc_lPSRhOmTeOm_.#c#opmrmo.tbou>f(f)S.irzuens([&NnCcCcLl_SPhRmOeTmO._wSoIrMkP)L;E ]\/ N C| C ^L _STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:s562i:z15e:o fnote: (field 'nthreads' will be initialized after field 'tidInBlock'T )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~562 | | group(group tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:)666,: 9n:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads( n666t | h r e a d s ) , ptriidmIsn(Btliodc,k (ntThhrreeaaddIsdGxa.txh)e,r ,g rdoiurpe(cgtr-o>uupp),, N U| L ^~~~~~~~~~~~~~~~~L , arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:s60e:n dnote: bfield 'group' will be initialized after field 'stepSize'u ff, ar g562s | - > r e ctvibdu(ftfi,d ) ,| ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202(:n53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea ds), 202t | i d I n B l o c kR(utnhWroerakdEIldexm.exn)t,< Fgnr,o uTp,( gRreoduOpp),, A l| g ^~~~~~~~~~~o , Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: warning: initializer order does not match the declaration order [-Wreorder-ctor]: 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u p| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 563| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) st e563p | S i z e (sntcecplSSihzmee(mn.cccolmSmh.mbeumf.fcSoimzme.sb[uNfCfCSLi_zPeRsO[TNOC_CSLI_MPPRLOET]O/_NSCICMLP_LSET]E/PNSC/CsLi_zSeToEfP(ST/)s)i z{e o f| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T ) )| group(group{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here641 | 687 | p r i m s ( t ipdr-itmisd(Sttiadr-ttRieddSutcaer,t BncTahsrte,a dnsTRherdeuacdes,B cdaisrte,c t&-d>idroewcnt,- >&oduitr,e cntu-l>loputtr,, aarrggss-->>sseennddbbuuffff,, aarrggss-->>rreeccvvbbuuffff,, | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202::20253::53 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R uRnuWnoWrokrEklEelmeemnetno(>)(.)r.urnu(nw(ew)e;) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp10::81::1 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herenote: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | 8I | MIPMLP_LC_OCLOLL_LF_UFNUCN(CA(lAllRleRdeudcuec,e ,C OCLOLLNLENTE_TD_IDRIERCETC,T ,S ISMIPMLPEL,E ,P rPerMeuMluSluSmu,m ,h ailnft)6 4 _| t^) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391expanded from macro 'IMPL_COLL_FUNC': 95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork< n391c | c l FRuunncW#o#rfkue,v rNeCdCoLp_#,# aNlCgCoL,_ ANLCGCOL__#P#RaOlTgOo_,# #NpCrCoLt_oP>R(O)T.Or_u#n#(p&rnoctcol>S(h)m.ermu.nw(o&rnkc)c;l S\h m e| m ^. work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h;: 562\: 15 :| ^note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~u p(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)60,: note: | field 'group' will be initialized after field 'stepSize' ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 60 : note: field 'group' will be initialized after field 'stepSize't id(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~( group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562o:f15(:T )warning: )initializer order does not match the declaration order [-Wreorder-ctor] { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:t626i:d9):, note: nin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hread s626( | n t h r e a d s )p,r itmisd(ItniBdl-otcikd(SttharretaSdcIadtxt.exr),, ngTrhoruepa(dgsrSocuapt)t,e r ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N U L| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), dire c563t | - > u p ,s taerpgSsi-z>es(enncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:)562,: 15g:r owarning: uinitializer order does not match the declaration order [-Wreorder-ctor]p (group), | 562 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d60(:t inote: dfield 'group' will be initialized after field 'stepSize') , nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up(gro u563p | ) , | s ^~~~~~~~~~~t epSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: 562warning: | initializer order does not match the declaration order [-Wreorder-ctor] tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~563 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ste p563S | i z e ( nsctcelpSShimzeem(.nccocmlmS.hbmuefmf.Sciozmems.[bNuCfCfLS_iPzReOsT[ON_CSCILM_PPLREO]T/ON_CSCILM_PSLTEE]P/SN/CsCiLz_eSoTfE(PTS)/)s i{z e o| f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~(: T562 ):| )15 group(group : { warning: initializer order does not match the declaration order [-Wreorder-ctor]| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 626 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: :9t655:i: d11note: (:in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret inote: din instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , 626n | t h655 r | e a d s ( n pt rh ir me sap(drtsii)md,s- (ttidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'nthreads' will be initialized after field 'tidInBlock': 562:15: 562warning: | initializer order does not match the declaration order [-Wreorder-ctor] tid(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~u p(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)60,: note: | field 'group' will be initialized after field 'stepSize' ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d ( tsitde)p,S inzteh(rnecacdlsS(hnmtehmr.ecaodmsm).,b utfifdSIinzBelso[cNkC(CtLh_rPeRaOdTIOd_xS.IxM)P,L Eg]r/oNuCpC(Lg_rSoTuEpP)S,/ s i| z ^~~~~~~~~~~e of(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkr(e)d.orpuc,l SNhCmCeLm_.AwLoGrOk_)#;# a\l g o| , ^ NCCL_PROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:t562o:>15(:) .note: rfield 'nthreads' will be initialized after field 'tidInBlock'u n(&ncclShm e562m | . w o r kt)i;d (\t i d| ) ^, nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's ), ti d562I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~~~~~~~I nBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd Idx.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~h re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:t562o:>15(:) .warning: rinitializer order does not match the declaration order [-Wreorder-ctor]u n(&nccl S562h | m e m . wtoirdk()t;i d\) , | n ^t hreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's ), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B lock (563t | h r e a dsItdexp.Sxi)z,e (gnrcoculpS(hgmreomu.pc)o,m m .| b ^~~~~~~~~~~~~~~~~u ffSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:s562[:N60C:C Lnote: _field 'group' will be initialized after field 'stepSize'P ROTO_SI M562P | L E ] / NtCiCdL(_tSiTdE)P,S /nstihzreeoafd(sT()n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ) ,| group(groupt idInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:c687k:(11t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea dIdx .687x | ) , g r o u p ( g rporuipm)s,( t i| d ^~~~~~~~~~~- tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , tid I563n | B l o c ks(ttehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s [ N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_PR O563T | O _ S I MsPtLeEp]S/iNzCeC(Ln_cScTlESPhSm/esmi.zceoomfm(.Tb)u)f f{S i z| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s [ N| C group(groupC L_PROTO_SIMPLE]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_641S:T11E:P Snote: /in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres izeof( T641) | ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupp rims(tid-ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:S666t:a9r:t Rnote: ein instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered uce, 666n | T h r e a d s R epdruicmes,( tdiidr,e cntT-h>rdeoawdns,G a&tdhierre,c td-i>roeuctt,- >aurpg,s -N>UsLeLn,d baurfgfs,- >asregnsd-b>urfefc,v baurfgfs,- > r| e ^c vbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^202 :53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 202:53: 202note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R u n W o r kREulneWmoernktE /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, NCC:L562_:A15L:G Owarning: _initializer order does not match the declaration order [-Wreorder-ctor]# #algo, NCCL _562P | R O T O _t#i#dp(rtoitdo)>,( )n.trhurne(a&dnsc(cnltShhrmeeamd.sw)o,r kt)i;d I\n B l| o ^c k(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I15d:x .note: xfield 'nthreads' will be initialized after field 'tidInBlock') , gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h read s563( | n t h r esatdesp)S,i ztei(dnIcncBllSohcmke(mt.hcroemamd.Ibduxf.fxS)i,z egsr[oNuCpC(Lg_rPoRuOpT)O,_ S I| M ^~~~~~~~~~~~~~~~~P LE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C60L:_ Snote: Tfield 'group' will be initialized after field 'stepSize'E PS/si z562e | o f ( T )t)i d{( t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | n group(groupt hreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s687):,11 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI nBloc k687( | t h r e a d I d x . xp)r,i mgsr(otuipd(-gtrioduSpt)a,r t B| c ^~~~~~~~~~~a st, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWo), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rk, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp):,562 : 15| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor]| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 562s | t e p S itzied((ntcicdl)S,h mnetmh.rceoamdms.(bnutfhfrSeiazdess)[,N CtCiLd_IPnRBOlToOc_kS(ItMhPrLeEa]d/INdCxC.Lx_)S,T EgPrSo/uspi(zgeroofu(pT)),) {| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | group(group 563 | stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hz:e655(:n11c:c lnote: Sin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh mem.c o655m | m . b u f f S i z e sp[rNiCmCsL(_tPiRdO-TtOi_dSSItMaPrLtER]e/dNuCcCeL,_ SnTTEhPrSe/asdiszReeodfu(cTe),) n{u l l| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t r ,| group(group& direct->o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:t626,: 9a:r gnote: sin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- >send b626u | f f , a r g s -p>rriemcsv(btuifdf-,t i d| S ^t artSca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:t202e:r53,: nnote: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh rea d202s | S c a t t e r , RNuUnLWLo,r kdEilreemcetn-t>dsOepn,d bAulfgfo,, aPrrgost-o>>r(e)c.vrbuunf(fw,e ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp202::1053::1 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]p rims(tid, n562T | h r e a dtsiGda(tthiedr),, dnitrherceta-d>su(pn,t hNrUeLaLd,s )a,r gtsi-d>IsneBnldobcukf(ft,h raeragdsI-d>xr.exc)v,b ugfrfo,u p (| g ^r oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 53 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 563202 | | s t e p SRiuzneW(onrckcEllSehmmeenmt._(P)R.OrTuOn_(SwIeM)P;L E ]| / ^N CCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp/:s12i:z1e:o fnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested hereT )) { | 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | I M| P group(groupL _COLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:A641l:l11R:e dnote: uin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec e, CO L641L | N E T _ D I R E C T ,p rSiImMsP(LtEi,d -PtriedMSutlaSrutmR,e dduocueb,l en)T h r| e^a dsRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:c391e:,95 :d inote: rexpanded from macro 'IMPL_COLL_FUNC'e ct->dow n391, | & dRiurneWcotr-k>fsuenncd,b utfyfp,e ,a rFgusn-c>#r#edcevvbruefdfo,p < t| y ^p e>, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:L202_:A53L:G Onote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here# #alg o202, | N C C L _ P R ORTuOn_W#o#rpkrEolteom>e(n)t. ^( ).run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:e562):;15 : | note: ^field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :56211 | : 1 : note: tin instantiation of member function 'RunWork, 2, 2>::run' requested herei d(t i11d | )I,M PnLt_hCrOeLaLd_sF(UnNtCh(rAelaldRse)d,u ctei,d ICnOBLlLoNcEkT(_tDhIrReEaCdTI,d xS.IxM)P,L Eg,r oPurpe(MgurloSuupm),, f l| o ^~~~~~~~~~~~~~~~~a t)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| :^60 : note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hfield 'group' will be initialized after field 'stepSize': 391:95: note: 562expanded from macro 'IMPL_COLL_FUNC' | t i391d | ( t iRunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562w:o15r:k )warning: ;initializer order does not match the declaration order [-Wreorder-ctor] \ | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :t inote: field 'nthreads' will be initialized after field 'tidInBlock'd (tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u p| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g roup )563, | | ^~~~~~~~~~~~~~~~~ ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:S562i:z60e:( nnote: cfield 'group' will be initialized after field 'stepSize'c lShme m562. | c o m m .tbiudf(ftSiidz)e,s [nNtChCrLe_aPdRsO(TnOt_hSrIeMaPdLsE)],/ NtCiCdLI_nSBTlEoPcSk/(stihzreeoafd(ITd)x). x{) , | g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group( group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_AL G562O | _ # # a ltgiod,( tNiCdC)L,_ PnRtOhTrOe_a#d#sp(rnotthor>e(a)d.sr)u,n (t&indcIcnlBSlhomcekm(.twhorreka)d;I d\x . x| ) ^, group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o15u:p )note: ,field 'nthreads' will be initialized after field 'tidInBlock' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ti d563( | t i d ) ,s tnetphSriezaed(sn(cnctlhSrhemaedms.)c,o mtmi.dbIunfBflSoiczke(st[hNrCeCaLd_IPdRxO.TxO)_,S IgMrPoLuEp](/gNrCoCuLp_)S,T E P| S ^~~~~~~~~~~~~~~~~/ siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:o562f:(60T:) )note: field 'group' will be initialized after field 'stepSize'{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562 | | group(group tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r641e:a11d:s (note: nin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreads )641, | t i d I n B l o c kp(rtihmrse(atdiIdd-xt.ixd)S,t agrrtoRuepd(ugcreo,u pn)T,h r e| a ^~~~~~~~~~~d sReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 563| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) st e563p | S i z e (sntcecplSSihzmee(mn.cccolmSmh.mbeumf.fcSoimzme.sb[uNfCfCSLi_zPeRsO[TNOC_CSLI_MPPRLOET]O/_NSCICMLP_LSET]E/PNSC/CsLi_zSeToEfP(ST/)s)i z{e o f| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T ) )| group(group{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :626:9: 655note: | in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | p r i m s ( tpirdi-mtsi(dtSitda-rttiRdeSdtuacret,S cnaTthtreera,d snRTehdruecaed,s Sncualtltpetrr,, N&UdLiLr,e cdti-r>eocutt-,> uapr,g sa-r>gsse-n>dsbeunfdfb,u fafr,g sa-r>grse-c>vrbeucfvfb,u f f| , ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202::20253::53 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R uRnuWnoWrokrEklEelmeemnetnt<>(())..rruunn((wwee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp::1111::11:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 1111 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, CCOOLLLLNNEETT__DDIIRREECCTT,, SSIIMMPPLLEE,, PPrreeMMuullSSuumm,, ffllooaatt)) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391391::9595:: note: note: expanded from macro 'IMPL_COLL_FUNC'expanded from macro 'IMPL_COLL_FUNC' 391 | 391 | R uRnuWnoWrokre,> ,N CNCCLC_LA_LAGLOG_O#_##a#laglog,o ,N CNCCLC_LP_RPORTOOT_O#_##p#rportoot>o(>)(.)r.urnu(n&(n&cncclcSlhSmhemme.mw.owrokr)k;) ;\ \ | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15::562 :note: 15field 'nthreads' will be initialized after field 'tidInBlock': note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562:15: :warning: 562initializer order does not match the declaration order [-Wreorder-ctor]: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid( t562i | d ) , tid(tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563s | t e p S isztee(pnSciczleS(hnmcecml.Schommemm..bcuofmfmS.ibzuefsf[SNiCzCeLs_[PNRCOCTLO__PSRIOMTPOL_ES]I/MNPCLCEL]_/SNTCECPLS_/SsTiEzPeSo/fs(iTz)e)o f{( T )| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ { | group(group| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11 :666 | note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here p687r | i m s ( t i d , n Tphrriemasd(stGiadt-hteird,S tdairrteBccta-s>tu,p ,n TNhUrLeLa,d saBrcgass-t>,s e&nddibruefcft,- >aorugts,- >nruelclvpbturf,f ,a r g| s ^- >sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f202f:,53 :a rnote: gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres ->re c202v | b u f f , | ^R unWorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:e202m:e53n:t , 2, 2>::run' requested heren , T, 202R | e d O p , A l gRou,n WPorroktEol>e(m)e.nrtu, 2, 2>::run' requested heret o>() .10r | uInM(PwLe_)C;O L L| _ ^F UNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppl:R12e:d1u:c enote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here COLL N12E | TI_MDPILR_ECCOTL,L _SFIUMNPCL(EA,l lPRreedMuucleS,u mC,O LhLaNlEfT)_ D I| R^E CT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :S391I:M95P:L Enote: ,expanded from macro 'IMPL_COLL_FUNC' PreMul S391u | m , RduonuWbolrek)< n c| c^l Func/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:#391f:u95n:c ,note: expanded from macro 'IMPL_COLL_FUNC't ype, F u391n | c # #RduenvWroerdkon,c #N#CfCuLn_cA,L GtOy_p#e#,a lFguon,c #N#CdCeLv_rPeRdOoTpO<_t#y#pper>o,t oN>C(C)L._rAuLnG(O&_n#c#callSghom,e mN.CwCoLr_k); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:P Rnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'T O_## p562r | o t o > (t)i.dr(utni(d&)n,c cnltShhrmeeamd.sw(onrtkh)r;e a\d s )| , ^ tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s (n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d snote: )field 'group' will be initialized after field 'stepSize', tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~~~~~~~d In/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c60k:( tnote: hfield 'group' will be initialized after field 'stepSize'r eadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~s (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hwarning: initializer order does not match the declaration order [-Wreorder-ctor] :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]562 | tid(tid), n t562h | r e a d tid(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup), 563 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t epSiz e563( | n c c l SshtmeepmS.iczoem(mn.cbculfSfhSmiezme.sc[oNmCmC.Lb_uPfRfOSTiOz_eSsI[MNPCLCEL]_/PNRCOCTLO__SSTIEMPPSL/Es]i/zNeCoCfL(_TS)T)E P{S / s| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~z e o| f group(group( T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 666| : group(group9 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 687 : 11p:r inote: min instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (tid, n T687h | r e a d s G a t h e rp,r idmisr(etcitd-->tuipd,S tNaUrLtLB,c aasrtg,s -n>TshernedabdusfBfc,a satr,g s&-d>irreeccvtb-u>fofu,t , | n ^u llptr,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a202r:g53s:- >note: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ndbu f202f | , a r g s - > rReucnvWbourfkfE,l e m| e ^n t, 2, 2>::run' requested here Algo ,202 | P r o t o > ( ) .RruunnW(owrek)E;l e m| e ^n t, 2, 2>::run' requested herep , Al g12o | ,I MPPrLo_tCoO>L(L)_.FrUuNnC((wAel)l;R e d| u ^c e, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppL:N12E:T1_:D Inote: Rin instantiation of member function 'RunWork, 2, 2>::run' requested hereE CT, S12I | MIPMLPEL,_ CPOrLeLM_uFlUSNuCm(,A ldloRuebdluec)e , | C^O LLNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:_391D:I95R:E Cnote: Texpanded from macro 'IMPL_COLL_FUNC', SIMPL E391, | P rReuMnuWloSrukm<,n cdcoluFbulnec)# # f| u^n c, t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hy:p391e:,95 :F unote: nexpanded from macro 'IMPL_COLL_FUNC'c ##devre d391o | p < tRyupneW>o,r kNp(<)t.yrpuen>(,& nNcCcClLS_hAmLeGmO._w#o#rakl)g;o ,\ N C| C ^L _PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:T562O:_15#:# pnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o to>() .562r | u n ( & ntcicdl(Sthimde)m,. wnotrhkr)e;a d\s ( n| t ^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562:15 :t inote: dfield 'nthreads' will be initialized after field 'tidInBlock'I nBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~~~~~~~) , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I60n:B lnote: ofield 'group' will be initialized after field 'stepSize'c k(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~h rea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562):,60 :t inote: dfield 'group' will be initialized after field 'stepSize'I nBloc k562( | t h r e atdiIdd(xt.ix)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~d s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562S:t15a:r twarning: Binitializer order does not match the declaration order [-Wreorder-ctor]c ast, nThr e562a | d s B c atsitd,( t&iddi)r,e cntt-h>roeuatd,s (nnutlhlrpetard,s )a,r gtsi-d>IsneBnldobcukf(ft,h raeragdsI-d>xr.exc)v,b ugfrfo,u p (| g ^r oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~202 : 53| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 563 | 202 | s t e p S i zReu(nnWcocrlkSEhlmeemme.nctoM(P)L.Er]u/nN(CwCeL)_;S T E| P ^S /sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppf:(9T:)1): {note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 9 group(group | IMPL_COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:F641U:N11C:( Anote: lin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel Reduc e641, | C O L L N E T _ D I RpErCiTm,s (StIiMdP-LtEi,d SPtraerMtuRleSduumc,e ,u innTth6r4e_atd)s R e| d^u ce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:i391r:e95c:t -note: >expanded from macro 'IMPL_COLL_FUNC'd own, & d391i | r e cRtu-n>Woourtk,< nacrcglsF-u>nsce#n#dfbuunfcf,, tayrpges,- >Fruenccv#b#udfefv,r e d| o ^p , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C202C:L53_:A Lnote: Gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereO _##a l202g | o , N C C L _ PRRuOnTWOo_r#k#Eplreomteon>t(<)F.nr,u nT(,& nRcecdlOSph,m eAml.gwoo,r kP)r;o t\o > (| ) ^. run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:)562;: 15 :| ^note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :56212 | : 1 : note: tin instantiation of member function 'RunWork, 2, 2>::run' requested herei d(ti d12) | ,I MnPtLh_rCeOaLdLs_(FnUtNhCr(eAaldlsR)e,d utcied,I nCBOlLoLcNkE(Tt_hDrIeRaEdCITd,x .SxI)M,P LgEr,o uPpr(egMruoluSpu)m,, d| o ^~~~~~~~~~~~~~~~~u bl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:)562 : 60| :^ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :56295 | : note: expanded from macro 'IMPL_COLL_FUNC' tid(ti d391) | , nRtuhnrWeoardks<(nnctchlrFeuandcs#)#,f utnicd,I ntBylpoe,c kF(utnhcr#e#addeIvdrxe.dxo)p,< tgyrpoeu>p,( gNrCoCuLp_)A,L G O| _ ^~~~~~~~~~~# #algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562):,15 :t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]I nBlock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Block (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidIn B563l | o c k ( tshtreepaSdiIzdex(.nxc)c,l Sghrmoeump.(cgormomu.pb)u,f f S| i ^~~~~~~~~~~z es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,warning: initializer order does not match the declaration order [-Wreorder-ctor]N CCL_ALGO_##algo, 562N | C C L _ PtRiOdT(Ot_i#d#)p,r onttoh>r(e)a.drsu(nn(t&hnrcecaldSsh)m,e mt.iwdoIrnkB)l;o c\k ( t| h ^r eadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p(gro u562p | ) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d), n t563h | r e a d ss(tnetphSriezaed(sn)c,c ltSihdmIenmB.lcoocmkm(.tbhurfefaSdiIzdexs.[xN)C,C Lg_rPoRuOpT(Og_rSoIuMpP)L,E ] /| N ^~~~~~~~~~~~~~~~~C CL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:E562P:S60/:s inote: zfield 'group' will be initialized after field 'stepSize'e of(T)) 562{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(group( tid), nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r655e:a11d:s )note: ,in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tidInB l655o | c k ( t h r e a d I dpxr.ixm)s,( tgirdo-utpi(dgSrtoaurpt)R,e d u| c ^~~~~~~~~~~e , nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t isdt(etpiSdi)z,e (nntchcrleSahdmse(mn.tchormema.dbsu)f,f StiizdeIsn[BNlCoCcLk_(PtRhOrTeOa_dSIIdMxP.LxE)],/ NgCrCoLu_pS(TgErPoSu/ps)i,z e o| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( T )| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 563| | group(group stepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:c666l:S9h:m enote: min instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. comm. b666u | f f S i z e s [ NpCrCiLm_sP(RtOiTdO,_ SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562E:,15 :P rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]M ulSum, half )562 | | ^ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t391i:d95):, note: nexpanded from macro 'IMPL_COLL_FUNC't hread s391( | n t hRruenaWdosr)k,< ntcicdlIFnuBnlco#c#kf(utnhcr,e atdyIpdex,. xF)u,n cg#r#oduepv(rgedroopu| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _ALG O563_ | # # a l gsot,e pNSCiCzLe_(PnRcOcTlOS_h#m#epmr.octoom>m(.)b.urfufnS(i&znecsc[lNSChCmLe_mP.RwOoTrOk_)S;I M\P L E| ] ^/ NCCL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:P562S:/15s:i znote: efield 'nthreads' will be initialized after field 'tidInBlock'o f(T)) 562{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(group( tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d677s:(11n:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads) ,677 | t i d I n B l o c k (ptrhirmesa(dtIiddx-.txi)d,S tgarrotuBpc(agsrto,u pn)T,h r e| a ^~~~~~~~~~~~~~~~~d sBca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562,: 60&:d inote: rfield 'group' will be initialized after field 'stepSize'e ct->o u562t | , d i rteicdt(-t>iddo)w,n ,n tahrrgesa-d>ss(enntdhbruefafd,s )a,r gtsi-d>IrneBclvobcukf(ft,h r e| a ^d Idx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r202o:u53p:( gnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo up), 202 | | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^~~~~~~~~~~~~~~~~ 563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :s tnote: efield 'group' will be initialized after field 'stepSize'p Size( n562c | c l S h mteimd.(ctoimdm).,b unftfhSriezaedss[(NnCtChLr_ePaRdOsT)O,_ StIiMdPILnEB]l/oNcCkC(Lt_hSrTeEaPdSI/dsxi.zxe)o,f (gTr)o)u p{( g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p )| , group(group | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hlReduc:e562,: 15C:O Lwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]N ET_DIRECT, SIM P562L | E , P rteiMdu(ltSiudm),, rnctchlr_ebafdlso(antt1h6r)e a d| s^) , tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:B391l:o95c:k (note: texpanded from macro 'IMPL_COLL_FUNC'h readIdx .391x | ) , RgurnoWuopr(kg.,c oNmCmC.Lb_uAfLfGSOi_z#e#sa[lNgCoC,L _NPCRCOLT_OP_RSOITMOP_L#E#]p/rNCoCtLo_>S(T)E.PrSu/ns(i&znecocfl(STh)m)e m{. w o| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k ) ;| group(group\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'687 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 687 | t i d ( t i d ) , nptrhirmesa(dtsi(dn-tthirdeSatdasr)t,B ctaisdtI,n BnlTohcrke(atdhsrBecaadsItd,x .&xd)i,r egcrto-u>po(ugtr,o unpu)l,l p t| r ^~~~~~~~~~~~~~~~~, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:s60e:n dnote: bfield 'group' will be initialized after field 'stepSize'u ff, ar g562s | - > r e ctvibdu(ftfi,d ) ,| ^n thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(202n:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s), 202t | i d I n B l o c kR(utnhWroerakdEIldexm.exn)t,< Fgnr,o uTp,( gRreoduOpp),, A l| g ^~~~~~~~~~~o , Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/N:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouhmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gropu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562_:F15U:N Cwarning: (initializer order does not match the declaration order [-Wreorder-ctor]A llReduc e562, | C O L LtNiEdT(_tDiIdR)E,C Tn,t hSrIeMaPdLsE(,n tPhrreeMaudlsS)u,m ,t iddoIunbBlleo)c k (| t^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:d391x:.95x:) ,note: expanded from macro 'IMPL_COLL_FUNC'g roup(group) ,391 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R u n| W tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o rkC,C LN_CPCRLO_TAOL_GSOI_M#P#LaEl]g/oN,C CNLC_CSLT_EPPRSO/TsOi_z#e#opfr(oTt)o)> ({) . r| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n ( &| n group(groupc clShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hw:o626r:k9):; note: \in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : pnote: rfield 'nthreads' will be initialized after field 'tidInBlock'i ms(ti d562- | t i d S ttairdt(Stciadt)t,e rn,t hnrTehardesa(dnstShcraetatdesr),, NtUiLdLI,n Bdliorcekc(tt-h>ruepa,d Iadrxg.sx-)>,s egnrdobuupf(fg,r oaurpg)s,- > r| e ^~~~~~~~~~~~~~~~~c vbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 : 60| : ^ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :562202 | : 53 : note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d(ti d202) | , n t h r e a dRsu(nnWtohrrkeEaldesm)e,n tto(u)p.(rgurno(uwpe)),; | | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdexpanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:212:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 212 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:212:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 212 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:224:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 224 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:224:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 224 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx906. 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_Pmmetric<1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ >(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, P154r | o t o L Lc1a2s8e, 3f:u l l| O ^p s>(comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp,: 5a:l9g:o ,note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herew ork); 5\ | | ^ MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ : warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ , 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), 5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :g154r:o10u:p (warning: gvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]r oup )154, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ c a| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e 3: | 563 ^ | st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cppe:p5S:i9z:e (note: nin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herec clSh m5e | m . c o m m . b uMfSfCSCiLz_eIsM[PNLC_CKLE_RPNREOLT_OE_NSTIRMYP_LFEU]N/CN_CDCELV_RSETDEOPPS_/TsYiPzEe(oSfu(mT,) )u i{n t 6| 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ t ,| group(groupf alse); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h ^: 217:57: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here405 :3: note: 217expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' | Primi t405i | v e smc,# #1d,e vPrreodtoop,< t0y>p ep>r,i mPsr o t| o ^S imple, ProtoSimple<2, 2>, false>' requested hereU NKSTE P5S | / M S C C L _ S LMISCCECSLT_EIPMSP,L _MSCCL_SLICESTEPS>, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp +In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, byt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hes:)154;: 10 :| warning: ^~~variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h405::1343::14 :note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'note: initialize the variable 'dst' to silence this warning 134 | 405 | mvsocicdl R*udnsItn,t e*rsprrce;t e r| < ^t y p| e = nullptr, Func##devredop, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: In file included from warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cppvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]: 1: In file included from 162/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 13 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:e167f: a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:l562t::15 : | warning: ^~~~~~~initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: 562uninitialized use occurs here | t165i | d ( t i dc)o,p ynTtohSrhemaedms8((nttihdr%eWaAdRsP)_,S ItZiEd,I ndBslto,c ks(rtch,r ebaydtIedsx).;x ) ,| ^~~g roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | :33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter13,: In file included from P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hr:o167t: o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562m:p15l:e B,l ofcukl(ltOhprse>a(dcIodmxm.,x )a,l ggor,o uwpo(rgkr)o;u p\) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 165:33: note: 563uninitialized use occurs here | s165t | e p S i zceo(pnycTcolSShhmmeemm8.(ctoimdm%.WbAuRfPf_SSiIzZeEs,[ NdCsCtL,_ PsRrOcT,O _bSyItMePsL)E;] / N| C ^~~C L_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hP:S162/:s5i:z ewarning: ovariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]f (T) )162 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d e f| a group(groupu lt: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::217165::5733:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested hereuninitialized use occurs here 217165 | | P r icmoiptyiTvoeSsh,, b1y,t ePsr)o;t o ,| ^~~0 > prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fuIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr llOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ SCCL_SLICESTEPS, MSCCL_SLICESTEPS>, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreterIn file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp :P1r: o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:o154S:i10m:p lwarning: evariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]< MSCC L154_ | C H U N KcSaTsEeP S3/:M S C| C ^L _SLICESTEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cppS:,5 :M9S:C Cnote: Lin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here_ SLIC E5S | T E P S > , f uMlSlCOCpLs_>I(McPoLm_mK,E RaNlEgLo_,E NwToRrYk_)F;U N\C _ D| E ^V REDOP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:T562Y:P15E:( Snote: ufield 'nthreads' will be initialized after field 'tidInBlock'm , ui n562t | 3 2 _ t ,t ifda(ltsied));, n| t ^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:(399n:t3h:r enote: aexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd s), tid I399n | B l omcskc(ctlhRruenaIdnItdexr.pxr)e,t egrrfield 'group' will be initialized after field 'stepSize', Pro t562o | L L , ftuildl(Otpisd>)(,c onmtmh,r eaaldgso(,n twhorreka)d;s )\, t| i ^d InBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:e165a:d33I:d xnote: .uninitialized use occurs herex ), gr o165u | p ( g r ocuopp)y,T o S| h ^~~~~~~~~~~m em8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr , src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: 217In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h: 169 : P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hr:i509m:i29t:i vwarning: efield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]s ,n t1h,r ePardost(on,t h0r>e apdrsi)m,s w i| d ^( tid%WARP_SIZE), w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cppa:r5p:(9t:i dnote: /in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hereW ARP_SIZ E5) | , | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)M SCC L508_ | I M P L _wKaErRpNIEnLB_lEoNcTkR(Yt_hFrUeNaCd_IDdExV.RxE/DWOAPR_PT_YSPIEZ(ES)u,m , | u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i n t| 3 warp(tid/WARP_SIZE2 _t, f509a | l s e ) ;f l a| g ^T hread(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:t405i:d3%:4 )note: =expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'= 3), gro u405p | ( g rmosucpc)l,R u n| I ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~n t e| r warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3p rete r510< | t y p e ,s tFeupnSci#z#ed(envcrceldSohpmm,m .PbruoftfoSSiizmepsl[eN ,| group(groupf ullOps>(comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 217a:l57g:o ,note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested herew ork) ;217 | \ Primitives < T| , ^ RedOp, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hF:a562n:A15s:y mnote: mfield 'nthreads' will be initialized after field 'tidInBlock'e tric<1 ,5621 | > , 1 ,t iPdr(ottiod,) ,0 >n tphrriemasd s (| n ^t hreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp):,5 :t9i:d Inote: nin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereB lock (5t | h r e a d I d x .MxS)C,C Lg_rIoMuPpL(_gKrEoRuNpE)L,_ E N| T ^~~~~~~~~~~~~~~~~R Y_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562_:D60E:V Rnote: Efield 'group' will be initialized after field 'stepSize'D OP_TY P562E | ( S u m ,t iudi(ntti3d2)_,t ,n tfharlesaed)s;( n t| h ^r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:)402,: 3t:i dnote: Iexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'n Bloc k402( | t h rmesacdcIldRxu.nxI)n,t egrrporuept(egrr, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWorIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ dPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), Pnthreads(rnotthorLeLa,d sf)u,l lwOipds(>t(icdo%mWmA,R Pa_lSgIoZ,E )w,o rwka)r;p (\t i d| / ^W ARP_SIZE),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165| : ~~~~~~~~~~~~~~~~~~33 : | note: stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)uninitialized use occurs here 508 | 165 | w a r pcIonpBylToocSkh(mtehmr8e(atdiIdd%xW.AxR/PW_ASRIPZ_ES,I ZdEs)t,, s| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c , | b warp(tid/WARP_SIZEy tes) ;509 | | ^~~ flagThr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:a162d:(5(:t iwarning: dvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]% 4)= =1623 | ) , g rdoeufpa(uglrto:u p )| , ^~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~: 165 :| 33 warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3: note: uninitialized use occurs here 510 | 165 | s t e pcSoipzyeT(onSchcmleSmh8m(etmi.dc%oWmAmR.Pb_uSfIfZSEi,z edss[tN,C CsLr_cP,R ObTyOt_eLsL)1;2 8 ]| / ^~~N CCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h5: | 134 : 14 : note: initialize the variable 'dst' to silence this warning MS C134C | L _ I M PvLo_iKdE R*NdEsLt_,E N*TsRrYc_;F U N| C ^_ D E| V = nullptrR EDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; In file included from | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp :| 1 = nullptr: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hO:f386f:s9e:t ;warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] | ^~~ 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, ds warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ t, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyTo 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ Shmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthrIn file included from eads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp :t1i: dI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hn:B154l:o10c:k (warning: tvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]h read I154d | x . x ) ,c agsreo u3p:( g r| o ^u p), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5: 9562: | note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here tid (5t | i d ) , n t h rMeSaCdCsL(_nItMhPrLe_aKdEsR)N,E Lt_iEdNITnRBYl_oFcUkN(Ct_hDrEeVaRdEIDdOxP._xT)Y,P Eg(rSouump,( gfrloouapt),, f a| l ^~~~~~~~~~~s e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: In file included from note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cppin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here: 1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :5154 | : 10 : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] MSCCL_ I154M | P L _ K EcRaNsEeL _3E:N T R| Y ^_ FUNC_DEVREDOP_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cppT:Y5P:E9(:S unote: min instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here, flo a5t | , f a l s e ) ;M S C| C ^L _IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:K402E:R3N:E Lnote: _expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'E NTRY_ F402U | N C _mDsEcVcRlERDuOnPI_nTtYePrEp(rSeutme,r :,3 :P rnote: oexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE't oLL12 8399, | f umlslcOcplsR>u(ncIonmtme,r parlegtoe,r , ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hrc, b:y154t:e10s:) ;warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] | ^~~ 154 | case 3/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 162 :| 5 ^: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cppd:e5f:a9u:l tnote: : in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here | ^~~~~~~ 5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 165 : 33 : note: uninitialized use occurs here MSCC L165_ | I M P L _cKoEpRyNTEoLS_hEmNeTmR8Y(_tFiUdN%CW_ADREPV_RSEIDZOEP,_ TdYsPtE,( Ssurmc,, fblyotaets,) ;f a l| s ^~~e ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnote: :uninitialized use occurs here134 :14: note: 165initialize the variable 'dst' to silence this warning | 134c | o p y T ovSohimde m*8d(stti,d %*WsArRcP;_ S I| Z ^E , | d = nullptrs t, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hwarning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] :154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]154 | case 3 :154 | | ^ case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5 :59 | : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here M5S | C C L _ I M P L _MKSECRCNLE_LI_MEPNLT_RKYE_RFNUENLC__EDNETVRRYE_DFOUPN_CT_YDPEEV(RSEuDmO,P _fTlYoPaEt(,S ufma,l sfel)o;a t ,| ^f alse)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h;: 402 :| 3 ^: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402 :4023 | : note: mexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE's cclRu n402I | n t emrspcrceltReurnv,r ePdroopt8,, PfruoltloOLpLs1>2(8c,o mfmu,l laOlpgso>,( cwoomrmk,) ;a l\g o ,| ^w ork); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h\: 165 :| 33 ^: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165: 33165: | note: uninitialized use occurs here cop y165T | o S h m ecmo8p(ytTiodS%hWmAeRmP8_(StIiZdE%,W AdRsPt_,S IsZrEc,, dbsytt,e ss)r;c , | b ^~~y tes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]: 162:5 :162 | warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] d162e | f a u l td:e f a| u ^~~~~~~l t:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165| : ^~~~~~~33 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnote: :uninitialized use occurs here165 :33: note: uninitialized use occurs here165 | 165c | o cpoypTyoTSohSmhemme8m(8t(itdi%dW%AWRAPR_PS_ISZIEZ,E ,d sdts,t ,s rscr,c ,b ybtyetse)s;) ; | ^~~| ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h134: | 134 : 14 : vnote: oinitialize the variable 'dst' to silence this warningi d * d134s | t , * svroci;d *| d ^s t ,| = nullptr* src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:In file included from 13/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp: :In file included from 1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h: :In file included from 169/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h13:: 509In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h29::169 : warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]: 509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | t507i | d ( t i dt)i,d (nttihdr)e,a dnst(hnrtehardesa(dnst)h,r ewaidds()t,i dw%iWdA(RtPi_dS%IWZAER)P,_ SwIZE), warpa(rtpi(dt/iWdA/RWPA_RSPI_ZSEI)Z,E ) ,| ~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~| stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | 508 | w a rwpaIrnpBIlnoBclko(ctkh(rtehardeIaddxI.dxx/.WxA/RWPA_RSPI_ZSEI)Z,E ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| warp(tid/WARP_SIZE | warp(tid/WARP_SIZE 509 | 509 | f l a gfTlhargeTahdr(e(atdi(d(%t4i)d=%=43))=,= 3g)r,o ugpr(ogurpo(ugpr)o,u p )| , ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | 510 | s t e p Ssitzeep(Sniczcel(SnhcmcelmS.hcmoemmm..cboumfmf.SbiuzfefsS[iNzCeCsL[_NPCRCOLT_OP_RLOLT1O2_8L]L/1N2C8C]L/_NSCTCELP_SS/TsEiPzSe/osfi(zueionft(6u4i_ntt)6)4 _{t ) )| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~{ | | group(group ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 217note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here57 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | P217r | i m iPtriivmeist<,1 ,11,> ,P r1o,t oP,r o0t>o ,p r0i>m sp r i| m ^s | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp :5:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp :note: 5in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here: 9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here5 | 5 | M S C C L _MISMCPCLL__KIEMRPNLE_LK_EERNNTERY_FLU_NECN_TDREYV_RFEUDNOCP__DTEYVPREE(DSOuPm_,T YfPlEo(aStu,m ,f afllsoea)t;, f| a ^l se)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h;: 402 :| 3 ^: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402: 3402: | note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'm scclRu n402I | n t emrspcrceltReurne,v rPerdootpo,, fPurloltOopLsL>1(2c8o,m mf,u lallOgpos,> (wcoormkm),; a\l g o| , ^ work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:N562G:,15 :S Iwarning: Minitializer order does not match the declaration order [-Wreorder-ctor]P LE, Sum P562o | s t D i vt,i di(ntti3d2)_,t )n t h| r^e ads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC') , tidIn B391l | o c kR(utnhWroerakde,p SNiCzCeL(_nAcLcGlOS_h#m#eaml.gcoo,m mN.CbCuLf_fPSRiOzTeOs_[#N#CpCrLo_tPoR>O(T)O._rSuInM(P&LnEc]c/lNSChCmLe_mS.TwEoPrSk/)s;i z\e o f| ( ^T )) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 15 :| group(groupnote: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h562: | 33 : 7 : tnote: iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hered (tid), 33n | t h r e a d sp(rnitmhsr(etaidds,) ,n tthirdeIandBsl,o c&kr(itnhgr-e>apdrIedvx,. x&)r,i nggr-o>unpe(xgtr,o uapr)g,s - >| s ^~~~~~~~~~~~~~~~~e ndb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:f562f:,60 :a rnote: gfield 'group' will be initialized after field 'stepSize's ->rec v562b | u f f , tairdg(st-i>dr)e,d OnptAhrrge,a d0s,( natrhgrse-a>dcso)n,n ItniddeIxn,B laorcgks(-t>hcroenandIInddxe.xx));, g| r ^o up(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hg:r78o:u5p:) ,note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here | ^~~~~~~~~~~ 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h::514386::99:: warning: warning: variable 'offset' set but not used [-Wunused-but-set-variable]variable 'wireOffset' set but not used [-Wunused-but-set-variable] 514386 | | iinntt owfifrseeOtf f=s etti d=; W i| r ^e WordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h165::15433::10 :note: uninitialized use occurs herewarning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 165 | 154 | c o p ycTaosSeh m3e:m 8 (| t ^i d%WARP_SIZE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp :d5s:t9,: snote: rin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested herec , byt e5s | ) ; | ^~~ MSC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hC:L162_:I5M:P Lwarning: _variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]K ERN E162L | _ E N T RdYe_fFaUuNlCt_:D E V| R ^~~~~~~E DO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hP:_165T:Y33P:E (note: Suninitialized use occurs hereu m, r165c | c l _ b fcloopaytT1o6S,h mfeaml8s(et)i;d % W| A ^R P_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hZ:E399,: 3d:s tnote: ,expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' src, b y399t | e s )m;s c c| l ^~~R unInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :d134s:t14,: snote: rinitialize the variable 'dst' to silence this warningc , b y134t | e s ) ; v o| i ^~~d *dst,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :*162s:r5c:; warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]| ^ | 162 = nullptr | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 134 : 14t:i dnote: (initialize the variable 'dst' to silence this warningt id) ,134 | n t h r evaodisd( n*tdhsrte,a d*ss)r,c ;t i d| I ^n B l| o = nullptrc k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: In file included from in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp :1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 1545: | 10 : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] M S154C | C L _ I McPaLs_eK E3R:N E L| _ ^E NTRY_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp_:D5E:V9R:E Dnote: Oin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested hereP _TYP E5( | S u m , r c c lM_SbCfClLo_aItM1P6L,_ KfEaRlNsEeL)_;E N T| R ^Y _FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hC:_405D:E3V:R Enote: Dexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'O P_TYPE(S u405m | , rmcscclc_lbRfulnoIantt1e6r,p rfeatlesre<)t;y p e| , ^ Fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:#399#:d3e:v rnote: eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd op , mPsrcoctloRSuinmIpnltee,, MPSrCoCtLo_LSLL,I CfEuSlTlEOPpSs>>,( cfoumlml,O pasl>g(oc,o mwmo,r ka)l;g o\, w| o ^r k); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h ^: 165:33:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :note: 165uninitialized use occurs here: 33: note: uninitialized use occurs here 165 | 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hhrea:d154I:d10x:. xwarning: )variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized], group( g154r | o u casep )3,: | | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, dataIn file included from 2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp,: 1f: lIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hg:213;: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h| : ^~~~~169 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h271::15319::21 :warning: unused variable 'ptr' [-Wunused-variable]warning: unused variable 'flag1' [-Wunused-variable] 153 | 271 | u i n t 3 2u_itn td6a4t_at1*, pftlra g=1 ,r edcavtPat2r,( 0f)l+algl21;2 8 O| f ^~~~~f se/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ht:;153 : 28| : ^~~ warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work);In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 154| : ^10 : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :165: 33154: | note: uninitialized use occurs here case 1653 | : | ^c opyToS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpph:m5e:m98:( tnote: iin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hered %WAR P5_ | S I Z E , d s tM,S CsCrLc_,I MbPyLt_eKsE)R;N E L| _ ^~~E NTRY_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hU:N162C:_5D:E Vwarning: Rvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]E DOP _162T | Y P E ( Pdreofda,u litn:t 3 2| _ ^~~~~~~t , f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:l165s:e33):; note: uninitialized use occurs here| ^ 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 405 : 3c:o pnote: yexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'T oShmem8 (405t | i d %mWsAcRcPl_RSuInZIEn,t edrsptr,e tserrc<,t ybpyet,e sF)u;n c #| # ^~~d evredop, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:o134p:y14T:o Snote: hinitialize the variable 'dst' to silence this warningm em8( t134i | d % W A RvPo_iSdI Z*Ed,s td,st, sr c, b*ystrecs;) ; | ^| ^~~ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::169134: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h14::509 :note: 29initialize the variable 'dst' to silence this warning: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 134 | vo i507d | * d s tt,i d*(stricd;) , | n ^t h r| e = nullptra ds(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.coIn file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cppm:.1b: uIn file included from f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hf:S13i: zIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hs:[167N: C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Twarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]_ LL128]/N C562C | L _ S T EtPiSd/(stiizde)o,f (nutihnrte6a4d_st()n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ) ,| group(groupt idInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hh:r217e:a57d:I note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here d x217. | x ) ,P rgirmiotuipv(esgm,e m1.,c oPmrmo.tbou,f f0S>i zpersi[mNsC C L| _ ^P ROTO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cppI:M5P:L9E:] /note: Nin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereC CL_S T5E | P S / s i z e o fM(STC)C)L _{I M P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ K E| R group(groupN EL_ENTRY/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:F217U:N57C:_ Dnote: Ein instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested hereV REDO P217_ | T Y PPEr(iPmriotdi,v eisn ,note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'1 , Pro t402o | , 0m>s cpcrliRmusn I n| t ^e rpret/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cppe:r5<:t9y:p enote: ,in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here Func# #5d | e v r e d o p < tMySpCeC>L,_ IPMrPoLt_oKLELR1N2E8L,_ EfNuTlRlYO_pFsU>N(Cc_oDmEmV,R EaDlOgPo_,T YwPoEr(kP)r;o d\, i| n ^t 32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(thIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^readIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :405154 | : 10 :m swarning: cvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]c lRunI n154t | e r p r ectaesre< t3y:p e ,| ^F unc##devre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cppd:o5p:<9t:y pnote: ein instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here> , Pro t5o | S i m p l e < M SMCSCCLC_LC_HIUMNPKLS_TKEEPRSN/EMLS_CECNLT_RSYL_IFCUENSCT_EDPESV,R EMDSOCPC_LT_YSPLEI(CPErSoTdE,P Si>n,t 3f2u_ltl,O pfsa>l(sceo)m;m , | a ^l go, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hw:o402r:k3):; note: \expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' | ^ 402 | ms/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:c165l:R33u:n Inote: nuninitialized use occurs heret erpret e165r | < t y p ec,o pFyuTnocS#h#mdeemv8r(etdiodp%S,I ZPEr,o tdosLtL,1 2s8r,c ,f ublyltOepss)>;( c o| m ^~~m , algo, wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hk:)162;: 5\: warning: | variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h162: | 165 : 33 : dnote: euninitialized use occurs heref ault :165 | | ^~~~~~~ co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:y165T:o33S:h mnote: euninitialized use occurs herem 8(ti d165% | W A R P _cSoIpZyET,o Sdhsmte,m 8s(rtci,d %bWyAtRePs_)S;I Z E| , ^~~ dst,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :s162r:c5,: bwarning: yvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]t es )162; | | ^~~ default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp::11: : In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::1313: : In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h::169167: : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::509562::2915:: warning: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 507 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,w itdi(dtIindB%lWoAcRkP(_tShIrZeEa)d,I dwxa.rxp)(,t igdr/oWuApR(Pg_rSoIuZpE)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 563508 | | swtaerppSIinzBel(oncckc(ltShhrmeeamd.Icdoxm.mx./bWuAfRfPS_iSzIeZsE[)N,C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P R O| T warp(tid/WARP_SIZEO _SIM P509L | E ] / N CfClLa_gSTThErPeSa/ds(i(zteiodf%(4T))=)= 3{) , | g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :| 217 ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~: 57 :| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 510217 | | P rsitmeiptSiivzees(L,_ P1R,O TPOr_oLtLo1,2 80]>/ NpCrCiLm_sS T E| P ^S /sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cppf:(5u:i9n:t 6note: 4in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here_ t)) {5 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group MSCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hI:M217P:L57_:K Enote: Rin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested hereN EL_E N217T | R Y _PFrUiNmCi_tDiEvVeRsEe,) ;1 , | P ^r oto,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :0405>: 3p:r inote: mexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE's | ^ 405 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp : 5m:s9c:c lnote: Rin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereu nInt e5r | p r e t e r < t yMpSeC,C LF_uInMcP#L#_dKeEvRrNeEdLo_pEF,U NPCr_oDtEoVSRiEmDpOlPe_ , mfsuclcllORpusn>I(nctoemrmp,r eatlegro<,t ywpoer,k )F;u n\c # #| d ^e vred/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:p562<:t15y:p enote: >field 'nthreads' will be initialized after field 'tidInBlock', Prot o562L | L 1 2 8 ,t ifdu(ltliOdp)s,> (nctohmrme,a dasl(gnot,h rweoardks));, \t i d| I ^n Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KER/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hN:E154L:_10E:N Twarning: Rvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]Y _FU N154C | _ D E V RcEaDsOeP _3T:Y P E| ( ^P rod, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cppu:i5n:t93:2 _note: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here, fal s5e | ) ; | ^ MSCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hI:M405P:L3_:K Enote: Rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'N EL_ENTRY_ F405U | N C _mDsEcVcRlERDuOnPI_nTtYePrEp(rPertoedr,< tuyipnet,3 2F_utn,c #f#adlesver)e;d o p| < ^t ype/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h>:,402 :P3r:o tnote: oexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'S impl e402< | M S CmCsLc_cClHRUuNnKISnTtEePrSp/rMeStCeCrL<_tSyLpIeC,E SFTuEnPcS#,# dMeSvCrCeLd_oSpLE,P SP>r,o tfouLlLl1O2p8s,> (fcuolmlmO,p sa>l(gcoo,m mw,o rakl)g;o ,\ w o| r ^k ); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::562165::1533:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'uninitialized use occurs here 165 | 562 | c otpiydT(otSihdm)e,m 8n(tthirde%aWdAsR(Pn_tShIrZeEa,d sd)s,t ,t isdrIcn,B lboyctke(st)h;r e a| d ^~~I dx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hx:)162,: 5g:r owarning: uvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]p (g r162o | u p ) , d e| f ^~~~~~~~~~~~~~~~~a ult: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^~~~~~~: 60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 165note: :field 'group' will be initialized after field 'stepSize'33 : note: uninitialized use occurs here 562 | 165 | t icdo(ptyiTdo)S,h menmt8h(rteiadd%sW(AnRtPh_rSeIaZdEs,) ,d stti,d IsnrBcl,o cbky(ttehsr)e;a d I| d ^~~x .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 154 : 10w:a rwarning: pvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]I nBlo c154k | ( t h r ecaadsIed x3.:x / W| A ^R P_SIZE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp):,5 : 9| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here| warp(tid/WARP_SIZE 5509 | | f l a gMTShCrCeLa_dI(M(PtLi_dK%E4R)N=E=L3_)E,N TgRrYo_uFpU(NgCr_oDuEpV)R,E D O| P ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~_ T Y| P warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3E (Prod ,510 | u i n t 3s2t_etp,S ifzael(snec)c;l S h| m ^e m.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:m405.:b3u:f fnote: Sexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'i zes[NCC L405_ | P R OmTsOc_cLlLR1u2n8I]n/tNeCrCpLr_eStTeErP ,| group(groupP rotoSim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:l217e:<57M:S Cnote: Cin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested hereL _CHU N217K | S T EPPrSi/mMiStCiCvLe_sSc,< 1f,u1l>l,O p1s,> (Pcroomtmo,, a0l>g op,r iwmosr k )| ; ^ \ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp :5:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h9::165 :note: 33in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here: note: uninitialized use occurs here 5 | 165 | cMoSpCyCTLo_SIhMmPeLm_8K(EtRiNdE%LW_AERNPT_RSYI_ZFEU,N Cd_sDtE,V RsErDcO,P _bTyYtPeEs()P;r o d| , ^~~ uint3/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h2:_162t:,5 :f awarning: lvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]s e); 162 | | ^ d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:f402a:u3l:t :note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h402: | 165 : 33m:s cnote: cuninitialized use occurs herel RunI n165t | e r p r ectoepryt,, Psrroct,o LbLy1t2e8s,) ;f u l| l ^~~O ps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRuwork); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr nInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp : 1 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 154 :M10S:C Cwarning: Lvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]_ IMPL_ K154E | R N E L _cEaNsTeR Y3_:F U N| C ^_ DEVREDOP_TYPE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp(:P5r:o9d:, note: iin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested heren t8_t, 5f | a l s e ) ; | M ^S CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hI:M405P:L3_:K Enote: Rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'N EL_ENT R405Y | _ F UmNsCc_cDlERVuRnEIDnOtPe_rTpYrPeEt(ePrr,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :P399r:o3t:o Snote: iexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'm ple, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr L_SLICESTEPS>, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cppm:81(: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hi:d154%:W10A:R Pwarning: _variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]S IZE, d154s | t , s rcca,s eb y3t:e s )| ; ^ | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp162::55::9 :warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 162 | 5 | d e f a u l t : M S| C ^~~~~~~C L_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hI:M165P:L33_:K Enote: Runinitialized use occurs hereN EL_E N165T | R Y _ F UcNoCp_yDTEoVSRhEmDeOmP8_(TtYiPdE%(WPArRoPd_,S IuZiEn,t 6d4s_tt,, sfracl,s eb)y;t e s| ) ^; | ^~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :s134r:c14,: bnote: yinitialize the variable 'dst' to silence this warningt es) ;134 | | ^~~ void/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :*162d:s5t:, warning: *variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]s rc; 162 | | ^ | d = nullptre fault: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :562134 | : 14 : note: tinitialize the variable 'dst' to silence this warningi d(tid )134, | n t h rveoaidds (*ndtshtr,e a*dssr)c,; t i| d ^I n B| l = nullptro ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple , fullOps>(comm, alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cppo:,5 :w9o:r knote: )in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here; \ | ^5 | MS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hC:C165L:_33I:M Pnote: Luninitialized use occurs here_ KERNEL_ E165N | T R Y _ FcUoNpCy_TDoESVhRmEeDmO8P(_tTiYdP%EW(APRrPo_dS,I ZfEl,o adts,t ,f aslrsce,) ;b y t| e ^s ); | ^~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :399:3: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE': 162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 399 | 162 | m s c c ldReufnaIunltte:r p r| e ^~~~~~~t er/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h<:t165y:p33e:, note: Funinitialized use occurs hereu nc# #165d | e v r e dcooppm,e mP8r(ottiodL%LW,A RfPu_lSlIOZpEs,> (dcsotm,m ,s racl,g ob,y tweosr)k;) ; | \ ^~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyTo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hS:h134m:e14m:8 (note: tinitialize the variable 'dst' to silence this warningi d%W A134RP | _ S I Z Ev,o idds t*,d sstr,c ,* sbrycte;s ) ;| ^ | ^~~| = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ C_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flagIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunIntIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr erpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, In file included from h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cppa:l1f: ,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hf:a13l: sIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h):;167 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :warning: 405initializer order does not match the declaration order [-Wreorder-ctor]: 3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 562 | 405 | t i dm(stcicdl)R,u nnItnhtreerapdrse(tnetrhI,d xP.rxo)t,o Sgirmopulpe(u,f ffSuilzleOsp[sN>C(CcLo_mPmR,O TaOl_gSoI,M PwLoEr]k/)N;C C\L _ S| T ^E PS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:i165z:e33o:f (note: Tuninitialized use occurs here) ) { 165| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupc opyToSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:e217m:857(:t inote: din instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here% WARP _217S | I Z EP,r idmsitt,i vsersc<,T ,b yRteedsO)p;, F| a ^~~n Asymm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:t162r:i5c:< 1warning: ,variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]1 >, 1621 | , P r odteof,a u0l>t :p r i| m ^~~~~~~s /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ^165 :33: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cppnote: :uninitialized use occurs here5 :9: 165note: | in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here c o5p | y T o S h m e m 8M(StCiCdL%_WIAMRPPL__SKIEZREN,E Ld_sEtN,T RsYr_cF,U NbCy_tDeEsV)R;E D O| P ^~~_ TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :f134u:ll14O:p snote: >initialize the variable 'dst' to silence this warning( comm ,134 | a l g o ,v owiodr k*)d;s t\, *| s ^r c; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562 :| 15 = nullptr: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groIn file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cppp:)1,: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ^~~~~~~~~~~13 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h):;514 :\9 : | warning: ^variable 'offset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :514165 | : 33 : note: iuninitialized use occurs heren t offs e165t | = t icdo;p y T| o ^S hmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp| : ^~~~~1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h::13153: :In file included from 35/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:: 168warning: : unused variable 'flag2' [-Wunused-variable]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :153:14 :153 | warning: unused variable 'data1' [-Wunused-variable] uint3 2153_ | t d a tuai1n,t 3f2l_atg 1d,a tdaa1t,a 2f,l afgl1a,g 2d;a t a| 2 ^~~~~, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:w134a:r14p: +note: initialize the variable 'dst' to silence this warning2 *wi d134; | | ^v oid *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp : 1u: iIn file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:6134_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 154134: | 10 : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]v oid 154* | d s t , c*assrec ;3 : | ^| ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp :note: 1in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13 : 5In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h : 169 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h : 509 : 29M:S Cwarning: Cfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]L _IMPL_KER N507E | L _ E N TtRiYd_(FtUiNdC)_,D EnVtRhErDeOaPd_sT(YnPtEh(rPeraodds,) ,d owuibdl(et,i df%aWlAsReP)_;S I Z| E ^) , w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:r399p:(3t:i dnote: /expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'W ARP_S I399Z | E ) ,m s c| c ~~~~~~~~~~~~~~~~~~l R u| n stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)I nter p508r | e t e r R,P _PSrIoZtEo)L,L , | f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u l l| O warp(tid/WARP_SIZEp s>(c o509m | m , a lfgloa,g Twhorreka)d;( (\t i d| % ^4 )==3), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hg:r165o:u33p:( gnote: runinitialized use occurs hereo up), 165 | | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | c warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3o pyToS h510m | e m 8 ( tsitde%pWSAiRzPe_(SnIcZcEl,S hdmsetm,. csormcm,. bbuyftfeSsi)z;e s [| N ^~~C CL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hO:_162L:L51:2 8warning: ]variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]/ NC C162L | _ S T E PdSe/fsaiuzleto:f ( u| i ^~~~~~~n t6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h4:_165t:)33): {note: uninitialized use occurs here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165| | group(group copyT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ho:S217h:m57e:m 8note: (in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested heret id%W A217R | P _ SPIrZiEm,i tdisvte,s , 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :402134 | : 14 :m snote: cinitialize the variable 'dst' to silence this warningc lRu n134I | n t e r pvroeitde r*, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h10::154 :warning: 10variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | 154 | c a scea s3e: 3 :| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp::55::99:: note: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herein instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | 5 | M SMCSCCLC_LI_MIPMLP_LK_EKRENRENLE_LE_NETNRTYR_YF_UFNUCN_CD_EDVERVERDEODPO_PT_YTPYEP(EP(rPordo,d ,d oduobulbel,e ,f aflasles)e;) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h405::4053::3 :note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | 405 | m smcscclcRluRnuInnItnetreprrperteetre, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e>, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreabuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreadIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 391 | : 562R:u15n:W owarning: rinitializer order does not match the declaration order [-Wreorder-ctor]k e,a dNsC)C,L _tAiLdGIOn_B#l#oaclkg(ot,h rNeCaCdLI_dPxR.OxT)O,_ #g#rporuopt(og>r(o)u.pr)u,n ( &| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c c l| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h mem.w o563r | k ) ; \s t e| p ^S ize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S15h:m enote: mfield 'nthreads' will be initialized after field 'tidInBlock'. comm. b562u | f f S i zteisd[(NtCiCdL)_,P RnOtThOr_eSaIdMsP(LnEt]h/rNeCaCdLs_)S,T EtPiSd/IsniBzleoocfk((Tt)h)r e{a d I| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x . x| ) group(group, group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hp:)33,: 7 :| ^~~~~~~~~~~~~~~~~note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60 :33 | note: field 'group' will be initialized after field 'stepSize' p r562i | m s ( t itdi,d (nttihdr)e,a dnst,h r&eraidnsg(-n>tphrreeva,d s&)r,i ntgi-d>InneBxlto,c ka(rtghsr-e>asdeInddxb.uxf)f,, garrogusp-(>grreocuvpb)u,f f ,| ^~~~~~~~~~~a rgs->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15A:L Gwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]_ ##algo, 562N | C C L _ PtRiOdT(Ot_i#d#)p,r onttoh>r(e)a.drsu(nn(t&hnrcecaldSsh)m,e mt.iwdoIrnkB)l;o c\k ( t| h ^r eadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(gr o562u | p ) , t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ( t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d ), n t563h | r e a d ss(tnetphSriezaed(sn)c,c ltSihdmIenmB.lcoocmkm(.tbhurfefaSdiIzdexs.[xN)C,C Lg_rPoup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connInde/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:)15;: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202562: | 53 : note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested heret id(t i202d | ) , n t h r e aRdusn(WnotrhkrEelaedmse)n,t r(o)u.pr(ugnr(owuep));, | | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp: 7563: | 1 : note: in instantiation of member function 'RunWork, 1, 2>::run' requested heres tepS i7z | eI(MnPcLc_lCSOhLmLe_mF.UcNoCm(mR.ebduufcfeSSiczaetst[eNrC,C LR_IPNRGO,T OS_ISMIPMLPEL,E ]S/uNmC,C Lu_iSnTtE3P2S_/ts)i z e| o^f (T)) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h{: 391 :| 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | expanded from macro 'IMPL_COLL_FUNC' group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h391: | 33 : 7R:u nnote: Win instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereo rkpper>e,v ,N C&CrLi_nAgL-G>On_e#x#ta,l gaor,g sN-C>CsLe_nPdRbOuTfOf_,# #aprrgost-o>>r(e)c.vrbuunf(f&,n cacrlgSsh-m>erme.dwOoprAkr)g;, \0 , args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'RunWork, 1, 2>::run' requested here562 :15: warning: 7initializer order does not match the declaration order [-Wreorder-ctor] | IMPL_COLL _562F | U N C(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s (nthr e563a | d s ) , sttiedpISniBzleo(cnkc(ctlhrShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCeadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562:15: :note: 562field 'nthreads' will be initialized after field 'tidInBlock': 15: warning: initializer order does not match the declaration order [-Wreorder-ctor]562 | tid(tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ti d563( | t i d ) ,s tnetphSriezaed(sn(cnctlhSrhemaedms.)c,o mtmi.dbIunfBflSoiczke(st[hNrCeCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, In file included from 0/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp,: 1a: rIn file included from g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hs:-10>: cIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hn:n167I: n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:e562x:,15 :a rwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]s ->connIndex); 562 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ht:i78d:(5t:i dnote: )in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here, n t78h | r e a d sr(unntRhirnegat(harregasd)I;d x .| x ^) , grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(202g:r53o:u pnote: )in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~202 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) R563u | n W o r ksEtleepmSeinzte<(Fnnc,c lTS,h mReemd.Ocpo,m mA.lbguof,f SPirzoetso[>N(C)C.Lr_uPnR(OwTeO)_;S I M| P ^L E]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppC:C5L:_1S:T Enote: Pin instantiation of member function 'RunWork, 1, 2>::run' requested hereS /si z5e | oIfM(PTL)_)C O{L L _| F ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~U N C| ( group(groupR educeScatter, RING, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hS:I33M:P7L:E ,note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereM ax, uint 833_ | t ) | ^ prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(391t:i95d:, note: nexpanded from macro 'IMPL_COLL_FUNC't hread s391, | & rRiunngW-o>rpkrfnuenxct,, tayrpges,- >Fsuenncd#b#udfefv,r eadrogps<-t>yrpeec>v,b uNfCfC,L _aArLgGsO-_>#r#eadlOgpoA,r gN,C C0L,_ PaRrOgTsO-_>#c#opnrnoItnod>e(x),. raurng(s&-n>cccolnSnhImnedme.xw)o;r k )| ; ^ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::5562:: 15note: :in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here note: field 'nthreads' will be initialized after field 'tidInBlock' 78 | 562 | r u n Rtiindg(t(harregasd)s;) , | t ^i dInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t202h:r53e:a dnote: Iin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hered x.x), 202g | r o u p ( g r o uRpu)n,W o r| k ^~~~~~~~~~~~~~~~~E le/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:e562n:t60<:F nnote: ,field 'group' will be initialized after field 'stepSize' T, R e562d | O p , Atligdo(,t iPdr)o,t on>t(h)r.eraudns((wnet)h;r e a| d ^s ), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppl:o4c:k1(:t hnote: rin instantiation of member function 'RunWork, 1, 2>::run' requested heree adId x4. | xI)M,P Lg_rCoOuLpL(_gFrUoNuCp()R,e d u| c ^~~~~~~~~~~e Scatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pwarning: )initializer order does not match the declaration order [-Wreorder-ctor], | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmeIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: expanded from macro 'IMPL_COLL_FUNC' :562:15: warning: 391initializer order does not match the declaration order [-Wreorder-ctor] | RunWorkl,o cNkC(CtLh_rAeLaGdOI_d#x#.axl)g,o ,g rNoCuCpL(_gPrRoOuTpO)_,# # p| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o t o| > tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( ).r u563n | ( & n c csltSehpmSeimz.ew(onrckc)l;S h\m e m| . ^c omm.buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562s:[15N:C Cnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'_ PROTO_S I562M | P L E ] /tNiCdC(Lt_iSdT)E,P Sn/tshirzeeaodfs((Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | t group(groupi dInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hd:I33d:x7.:x )note: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here group (33g | r o u p ) , p r| i ^~~~~~~~~~~~~~~~~m s(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s, &ri n562g | - > p r etvi,d (&triidn)g,- >nntehxrte,a dasr(gnst-h>rseeanddsb)u,f ft,i daIrngBsl-o>crke(ctvhbruefafd,I daxr.gxs)-,> rgerdoOuppA(rggr,o u0p,) ,a r g| s ^~~~~~~~~~~- >connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562h:m15e:m .warning: cinitializer order does not match the declaration order [-Wreorder-ctor]o mm.buffSiz e562s | [ N C C Lt_iPdR(OtTiOd_)S,I MnPtLhEr]e/aNdCsC(Ln_tShTrEePaSd/ss)i,z etoifd(ITn)B)l o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hr:o33u:p7(:g rnote: oin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereu p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 33 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) p r563i | m s ( t isdt,e pnStihzree(andcsc,l S&hrmienmg.-c>opmrme.vb,u f&frSiinzge-s>[nNeCxCtL,_ PaRrOgTsO-_>SsIeMnPdLbEu]f/fN,C CaLr_gSsT-E>PrSe/csvibzueof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15R:u nwarning: Winitializer order does not match the declaration order [-Wreorder-ctor]o rkElemen t562< | F n , Tt,i dR(etdiOdp),, Anltghor,e aPdrso(tnot>h(r)e.ardusn)(,w et)i;d I n| B ^l ock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppe:a8d:I1d:x .note: xin instantiation of member function 'RunWork, 1, 2>::run' requested here) , gr o8u | pI(MgPrLo_uCpO)L,L _ F| U ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N C (| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e duceS c563a | t t e r ,s tReIpNSGi,z eS(InMcPcLlES,h mMeamx.,c oimnmt.6b4u_ftf)S i z| e^s [NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_391P:R95O:T Onote: _expanded from macro 'IMPL_COLL_FUNC'S IMPLE] /391N | C C LR_uSnTWEoPrSk/, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herep ,33 | N C C L _ A LpGrOi_m#s#(atligdo,, nNtChCrLe_aPdRsO,T O&_r#i#npgr-o>tpor>e(v),. r&urni(n&gn-c>cnleSxhtm,e ma.rwgosr-k>)s;e n\d b u| f ^f , arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:r15e:c vnote: bfield 'nthreads' will be initialized after field 'tidInBlock'u ff, a r562g | s - > r etdiOdp(Atrigd,) ,0 ,n tahrrgesa-d>sc(onntnhIrnedaedxs,) ,a rtgisd-I>ncBolnoncIkn(dtehxr)e;a d I| d ^x .x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hg:r78o:u5p:( gnote: rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereo up) ,78 | | ^~~~~~~~~~~~~~~~~ ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:R562i:n60g:< Tnote: ,field 'group' will be initialized after field 'stepSize' RedO p562, | P r o ttoi>d((argst)i;d ) ,| ^n threads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t202h:r53e:a dnote: sin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here) , ti d202I | n B l o c k ( t hRruenaWdoIrdkxE.lxe)m,e ngtr().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:i562n:g15<:T ,warning: initializer order does not match the declaration order [-Wreorder-ctor]R edOp, Pr o562t | o > ( a rtgisd)(;t i d| ) ^, nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(202n:t53h:r enote: ain instantiation of member function 'RunWorkElement, 1, 2>::run' requested hered s), t202i | d I n B l o c k (RtuhnrWeoardkIEdlxe.mxe)n,t ( )563. | r u n ( wset)e;p S i| z ^e (nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppS:h9m:e1m:. cnote: oin instantiation of member function 'RunWork, 1, 2>::run' requested herem m.bu f9f | SIiMzPL_eCsO[LNLC_CFLU_NPCR(ORTeOd_uScIeMSPcLaEt]t/eNrC,C LR_ISNTGE,P SS/IsMiPzLeEo,f (MTa)x), {u i n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~6 4 _| t group(group) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::33391::795:: note: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereexpanded from macro 'IMPL_COLL_FUNC' 33391 | | R u n Wporrikmupnrce#v#,d e&vrriendgo-p> ,a rNgCsC-L>_sAeLnGdOb_u#f#fa,l gaor,g sN-C>CrLe_cPvRbOuTfOf_,# #aprrgost-o>>r(e)d.OrpuAnr(g&,n c0c,l Sahrmgesm-.>wcoornkn)I;n d\e x ,| ^a rgs->c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:n562n:I15n:d enote: xfield 'nthreads' will be initialized after field 'tidInBlock') ; | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h :t78i:d5(:t inote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here) , nt h78r | e a d s (rnutnhRrienagdh(raeragdsI)d;x . x| ) ^, group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r202o:u53p:) ,note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h202: | 562 : 60 : note: field 'group' will be initialized after field 'stepSize' Run W562o | r k E l etmiedn(tti(d)I.nrBulno(cwke()t;h r e| a ^d Idx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppg:r13o:u1p:( gnote: rin instantiation of member function 'RunWork, 1, 2>::run' requested hereo up), 13 | | I ^~~~~~~~~~~M PL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWorkE l562e | m e n t s()),. rtuind(IwneB)l;o c k| ( ^t hreadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cppx:.4x:)1,: gnote: rin instantiation of member function 'RunWork, 1, 2>::run' requested hereo up(g r4o | uIpM)P,L _ C| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L L _| F tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)U NC(R e563d | u c e S csattetpeSri,z eR(InNcGc,l SShImMePmL.Ec,o mMmi.nb,u fifnSti8z_ets)[ N C| C^L _PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:_391S:I95M:P Lnote: Eexpanded from macro 'IMPL_COLL_FUNC'] /NCCL_ S391T | E P SR/usniWzoerokf<(nTc)c)l F{u n c| # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~# f u| n group(groupc , type/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h,: 33F:u7n:c #note: #in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hered evredop <33t | y p e > , NpCrCiLm_sA(LtGiOd_,# #natlhgroe,a dNsC,C L&_rPiRnOgT-O>_p#r#epvr,o t&or>i(n)g.-r>unne(x&tn,c calrSghsm-e>ms.ewnodrbku)f;f ,\ a r| g ^s ->rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hv:b562u:f15f:, note: afield 'nthreads' will be initialized after field 'tidInBlock'r gs->r e562d | O p A r gt,i d0(,t iadr)g,s -n>tchornenaIdnsd(enxt,h raeragdss-)>,c otnindIInndBelxo)c;k ( t| h ^r eadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hx:.78x:)5,: gnote: rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereo up( g78r | o u p ) ,r u n| R ^~~~~~~~~~~~~~~~~i ng562( | a r g s )t;i d (| t ^i d), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d snote: (in instantiation of member function 'RunWorkElement, 1, 2>::run' requested heren thre a202d | s ) , t i d I nRBulnoWcokr(ktEhlreemaednItd, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:391:95:: 562note: :expanded from macro 'IMPL_COLL_FUNC'15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | RunWorki,d INnCBClLo_cAkL(GtOh_r#e#aadlIgdox,. xN)C,C Lg_rPoRuOpT(Og_r#o#uppr)o,t o >| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) . r| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n (&nccl S563h | m e m . wsotrekp)S;i z\e ( n| c ^c lShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:c562o:m15m:. bnote: ufield 'nthreads' will be initialized after field 'tidInBlock'f fSiz e562s | [ N C C Lt_iPdR(OtTiOd_)S,I MnPtLhEr]e/aNdCsC(Ln_tShTrEePaSd/ss)i,z etoifd(ITn)B)l o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ho:u33p:)7,: note: | in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 56233: | 60 : note: field 'group' will be initialized after field 'stepSize' prim s562( | t i d , tnitdh(rteiadd)s,, n&trhirnega-d>sp(rnetvh,r e&ardisn)g,- >tniedxItn,B laorcgks(-t>hsreenaddbIudfxf.,x )a,r ggsr-o>urpe(cgvrbouufpf),, a r| g ^~~~~~~~~~~s ->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorki,d INnCBClLo_cAkL(GtOh_r#e#aadlIgdox,. xN)C,C Lg_rPoRuOpT(Og_r#o#uppr)o,t o >| ( ^~~~~~~~~~~) .run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, Proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVLL128, fullOps>(comm, algo, work); \ | ^ REDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165h | r e a d sc(onptyhTroeSahdmse)m,8 (ttiiddI%nWBAlRoPc_kS(ItZhEr,e addsItd,x .sxr)c,, gbryotueps()g;r o u| p ^~~) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives | , 1 , vPoriodt o*,d s0t>, p*rsirmcs; | | ^ ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple | , f u ltliOdp(st>i(dc)o,m mn,t harlegaod,s (wnotrhkr)e;a d\s ) ,| ^w id(tid%/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hW:A562R:P15_:S Inote: Zfield 'nthreads' will be initialized after field 'tidInBlock'E ), war p562( | t i d / WtAiRdP(_tSiIdZ)E,) ,n t h| r ~~~~~~~~~~~~~~~~~~e a d| s stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)( nthr e508a | d s ) , wtairdpIInnBBlloocckk((tthhrreeaaddIIddxx..xx)/,W AgRrPo_uSpI(ZgEr)o,u p )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ warp(tid/WARP_SIZE /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562509: | 60 : note: field 'group' will be initialized after field 'stepSize'f lagT h562r | e a d ( (ttiidd(%t4i)d=)=,3 )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ t i| d warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3I nBlock (510t | h r e a dsItdexp.Sxi)z,e (gnrcoculpS(hgmreomu.pc)o,m m .| b ^~~~~~~~~~~u ffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid),In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cppn:t1h: r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:a154d:s10(:n twarning: hvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]r eads), 154t | i d I n Bclaoscek (3t:h r e| a ^d Idx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cppu:p5(:g9r:o unote: pin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here) , | ^~~~~~~~~~~~~~~~~ 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 : note: Mfield 'group' will be initialized after field 'stepSize'S CCL_I M562P | L _ K E RtNiEdL(_tEiNdT)R,Y _nFtUhNrCe_aDdEsV(RnEtDhOrPe_aTdYsP)E,( Mtaixd,I niBnlto3c2k_(tt,h rfeaaldsIed)x;. x )| , ^ group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:g399r:o3u:p )note: ,expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' | ^~~~~~~~~~~ 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | Int = nullptre rpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpph:r1e: aIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:)13,: In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hi:d167I: n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~t hre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l o c| k tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( threa d563I | d x . x )s,t egprSoiuzpe((gnrcoculpS)h,m e m| . ^~~~~~~~~~~c omm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32In file included from _t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp :d1a: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:1154,: 10f:l awarning: gvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]1 , data 2154, | f l a gc2a;s e | 3 ^~~~~: | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h ^: 153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp : 5 : 9u:i nnote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here3 2_t da t5a | 1 , f l a g 1 ,M SdCaCtLa_2I,M PfLl_aKgE2R;N E L| _ ^~~~~E NT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hR:Y153_:F28U:N Cwarning: _unused variable 'data2' [-Wunused-variable]D EVR E153D | O P _ T YuPiEn(tM3a2x_,t udiantta312,_ tf,l afga1l,s ed)a;t a 2| , ^ flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :134 :M14S:C Cnote: Linitialize the variable 'dst' to silence this warning_ IMP L134_ | K E R N EvLo_iEdN T*RdYs_tF,U N*Cs_rDcE;V R E| D ^O P _| T = nullptrY PE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:In file included from 33/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:: 1note: : uninitialized use occurs hereIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13 : 165In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h : 167 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:o562p:y15T:o Swarning: hinitializer order does not match the declaration order [-Wreorder-ctor]m em8(tid %562W | A R P _ StIiZdE(,t idds)t,, nstrhcr,e abdyst(enst)h;r e a| d ^~~s ), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hn:B162l:o5c:k (warning: tvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]h rea d162I | d x . x )d,e fgaruolutp:( g r| o ^~~~~~~u p),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~33 : | note: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)uninitialized use occurs here 563165 | | sctoeppySTiozSeh(mnecmc8l(Sthimde%mW.AcRoPm_mS.IbZuEf,f Sdiszte,s [sNrCcC,L _bPyRtOeTsO)_;S I M| P ^~~L E]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 134:14: note: initialize the variable 'dst' to silence this warning 134 | vMoSiCdC L*_dIsMtP,L _*KsErRcN;E L _| E ^N T R| Y = nullptr_ FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, alIn file included from g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cppo:,1 : wIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:k13): ;In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h\: 169 : | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h ^: 509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :507562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e awdisd((nttihdr%eWaAdRsP)_,S ItZiEd)I,n Bwlaorcpk((ttihdr/eWaAdRIPd_xS.IxZ)E,) ,g r o| u ~~~~~~~~~~~~~~~~~~p ( g| r stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)o up), 508 | | ^~~~~~~~~~~~~~~~~ war/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:I562n:B60l:o cnote: kfield 'group' will be initialized after field 'stepSize'( thread I562d | x . x / WtAiRdP(_tSiIdZ)E,) ,n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s warp(tid/WARP_SIZE( nthr e509a | d s ) , ftliadgITnhBrleoacdk(((tthirde%a4d)I=d=x3.)x,) ,g rgorupo(ugpr(ogurpo)u,p ) ,| ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | ^~~~~~~~~~~| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmeIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->reIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkconnIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouvpr(egdroopu| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _ALGO _563# | # a l g os,t eNpCSCiLz_eP(RnOcTcOl_S#h#mpermo.tcoo>m(m)..bruufnf(S&inzcecsl[SNhCmCeLm_.PwRoOrTkO)_;S I\M P L| E ^] /NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:o15f:( Tnote: )field 'nthreads' will be initialized after field 'tidInBlock') { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562| | group(group tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ht:i33d:)7,: nnote: tin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereh reads( n33t | h r e a d s )p,r itmisd(ItniBdl,o cnkt(htrheraedasd,I d&xr.ixn)g,- >gprroeuvp,( g&rroiunpg)-,> n e| x ^~~~~~~~~~~~~~~~~t , ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:s562-:>60s:e nnote: dfield 'group' will be initialized after field 'stepSize'b uff, a562r | g s - > rteicdv(btuifdf),, anrtghsr-e>ardesd(OnptAhrrge,a d0s,) ,a rtgisd-I>ncBolnoncIkn(dtehxr,e aadrIgdsx-.>xc)o,n ngIrnoduepx()g;r o u| p ^) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56260::15 :note: field 'group' will be initialized after field 'stepSize'warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo,field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(g r563o | u p ) , s t| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p S i| z tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e (nccl S563h | m e m . csotmemp.Sbiuzfef(SniczcelsS[hNmCeCmL._cPoRmOmT.Ob_uSfIfMSPiLzEe]s/[NNCCCCLL__SPTREOPTSO/_sSiIzMePoLfE(]T/)N)C C{L _ S| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E P S| / group(groups izeof(T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 33 :| 7 group(group: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h : 33 : 7p:r inote: min instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested heres (tid, n t33h | r e a d s , p&rriimnsg(-t>ipdr,e vn,t h&rreiandgs-,> n&erxitn,g -a>rpgrse-v>,s e&nrdibnugf-f>,n eaxrtg,s -a>rrgesc-v>bsuefnfd,b uafrfg,s -a>rrgesd-O>prAercgv,b u0f,f ,a ragrsg-s>-c>ornendIOnpdAerxg,, a0r,g sa-r>gcso-n>ncIonndneIxn)d;e x ,| ^a rgs->c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ho:n78n:I5n:d enote: xin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here) ; | 78 ^ | runR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hi:n78g:<5T:, note: Rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heree dOp, 78P | r o t o >r(uanrRgisn)g;< T ,| ^R edOp, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:r202o:t53o:> (note: ain instantiation of member function 'RunWorkElement, 1, 2>::run' requested herer gs); 202| | ^ Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:W202o:r53k:E lnote: ein instantiation of member function 'RunWorkElement, 1, 2>::run' requested herem ent< F202n | , T , R e d ORp, Algo, PruontWoo>r(k)E.lreumne(nwte<)F;n , | T ^, RedOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp :A13l:g1o:, note: Pin instantiation of member function 'RunWork, 1, 2>::run' requested herer oto>( )13. | rIuMnP(Lw_eC)O;L L _| F ^U NC(Redu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cppc:e11S:c1a:t tnote: ein instantiation of member function 'RunWork, 1, 2>::run' requested herer , R I11N | GI,M PSLI_MCPOLLEL,_ FPUrNeCM(uRleSduumc,e Srccactlt_ebrf,l oRaItN1G6,) S I| M^P LE, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:M391u:l95S:u mnote: ,expanded from macro 'IMPL_COLL_FUNC' float) 391| | ^ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hW:o391r:k95<:n cnote: cexpanded from macro 'IMPL_COLL_FUNC'l Func# #391f | u n cR,u ntWyoprek,< nFcucnlcF#u#ndce#v#rfeudnocp,< ttyyppee>,, FNuCnCcL#_#AdLeGvOr_e#d#oaplC,C LN_CPCRLO_TAOL_G#O#_p#r#oatlog>o(,) .NrCuCnL(_&PnRcOcTlOS_h#m#epmr.owtoor>k());. r\u n (| & ^n cclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:w15o:r knote: )field 'nthreads' will be initialized after field 'tidInBlock'; \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15(:t inote: dfield 'nthreads' will be initialized after field 'tidInBlock') , nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~, gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g60r:o unote: pfield 'group' will be initialized after field 'stepSize') , | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i60d:( tnote: ifield 'group' will be initialized after field 'stepSize'd ), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~, group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] In file included from 162/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp | : 1 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :d154e:f10a:u lwarning: tvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]: | ^~~~~~~ 154 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165 : 33c:a snote: euninitialized use occurs here 3: | 165 ^ | copy/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cppT:o5S:h9m:e mnote: 8in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here( tid% W5A | R P _ S I Z E , MdSsCtC,L _sIrMcP,L _bKyEtReNsE)L;_ E N| T ^~~R Y_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 134| : ^14 : note: initialize the variable 'dst' to silence this warning /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 165134: | 33 : note: uninitialized use occurs herev oid *d s165t | , * s rcco;p y T| o ^S h m| e = nullptrm 8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 15 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ho:u154p:(10g:r owarning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ up), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work);In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | ui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ nt32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | steIn file included from p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cppS:i1z: e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:n154c:c10l:S hwarning: mvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]e m.comm.bu f154f | S i z e sc[aNsCeC L3_:P R O| T ^O _SIMPLE]/NCCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp/:s5i:z9e:of(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *sIn file included from rc; /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp :| 1 ^: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : = nullptr154 :10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, fa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hl:s154e:)10;: warning: | variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :154399 | : 3 : note: cexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'a se 3: 399 | | ^ mscIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cppl:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cppR1:u: 5nIn file included from :I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h9n::t13 e: note: rIn file included from in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested herep/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h r:e167t: e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h5r: | <562 t: y15 p: e ,warning: initializer order does not match the declaration order [-Wreorder-ctor] F uMnScC #C562#L | d_ eI vM rP eLtd_ioKdpE(E ,Nn TtPRhrYro_etFaoUdLNsLC(,_n DtfEhuVrlRelEaODdpOssP)>_,(T cYtoPimEdm(I,Mn aBaxll,og codk,o( utwbholrreek,a) d;fI ad\lx s. ex| )) ^;, g| r ^o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hu:p165(:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hg33:r:402o :unote: 3puninitialized use occurs here:) ,note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' | 165 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | 402 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | c om ps563yc | Tc ol SR hu mnseItmne8tp(eStriipzdre%e(WtnAecRrcP_| ,P ^~~ R POrToOt_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hoS:LI162LM:1P52L:8E ,]warning: /variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]fN uCl Cl162LO | _p Ss T> E( PcdSoe/mfsmai,uz leatol:fg (o T,| ) ^~~~~~~)w o{/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr :k 165)| :; ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~33 :\ | note: group(groupuninitialized use occurs here| ^ 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: :165 217: :33c57:o: p note: ynote: uninitialized use occurs hereTin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here o Shmem 8 165(217 | t | i d %P WrcAioRmpPiy_tTSioIvSZehEsm, ,b y1t,e sP)r;o t o| , ^~~ 0> pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:s162 : 5| : ^ warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp :1625 | : 9 : note: din instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested heree faul t5: | | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: :165M134:S:33C14:C: L note: _note: uninitialized use occurs hereIinitialize the variable 'dst' to silence this warning M PL _ 165K134 | E | R N E L _vEoNiTdR Y*_dFsUtN,C _*DsErVcR;E D O| P ^_ T Y| P = nullptrE (Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnthr:e154a:d10s:( nwarning: tvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]h reads), 154t | i d I n Bclaoscek (3t:h r e| a ^d Idx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cppg:r5o:u9p:) ,note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here | ^~~~~~~~~~~ 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/git_version.cpp.o -MF CMakeFiles/rccl.dir/git_version.cpp.o.d -o CMakeFiles/rccl.dir/git_version.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/git_version.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/host_table.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE In file included from 509/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp | : 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hf:l13a: gIn file included from T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hh:r168e: a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hd:(153(:t14i:d %warning: 4unused variable 'data1' [-Wunused-variable]) ==3), gr o153u | p ( g r ouuipn)t,3 2 _| t ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ d a| t warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3a 1, fl a510g | 1 , d asttae2p,S ifzlea(gn2c;c l S| h ^~~~~m em.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ho:m153m:.21b:u fwarning: funused variable 'flag1' [-Wunused-variable]S ize s153[ | N C C L _uPiRnOtT3O2__LtL 1d2a8t]a/1N,C CfLl_aSgT1E,P Sd/astiaz2e,o ff(luaign2t;6 4 _| t ^~~~~) ) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :| 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 28 :| group(groupwarning: unused variable 'data2' [-Wunused-variable] 153/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 217 : 57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h 386 | : 154 : 10 :i nwarning: tvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] wireOff s154e | t = WciarseeW o3r:d P e| r ^S lice*warp + /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp2:*5w:i9d:; note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here| ^ 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ic<1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h402: | 154 : 10m:s cwarning: cvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]l RunIn t154e | r p r e tcearsnote: ,in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here Prot o5L | L 1 2 8 , f u lMlSOCpCsL>_(IcMommP,L _aKlEgRoN,E Lw_oErNkT)R;Y _\F U N| C ^_ DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | In file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpps:c1c: lIn file included from R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hu:n13I: nIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.he:r169p: r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.he:t509e:r29<:t ywarning: pfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]e , Func## d507e | v r e d otpi),, PnrtohtroeSaidmsp(lnetE,) ,f u l| l ~~~~~~~~~~~~~~~~~~O p s| > stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)( comm, 508a | l g o , wwaorrpkI)n;B l\o c k| ( ^t hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:I165d:x33.:x /note: Wuninitialized use occurs hereA RP_S I165Z | E ) , c| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p y T| o warp(tid/WARP_SIZES hmem 8509( | t i d % WfAlRaPg_TShIrZeEa,d (d(stti,d %s4r)c=,= 3b)y,t egsr)o;u p (| g ^~~r oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :| 162 ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~: 5 :| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 510 | 162 | s t e pdSeifzaeu(lntc:c l S| h ^~~~~~~m em./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:o165m:m33.:b unote: funinitialized use occurs heref Size s165[ | N C C L _cPoRpOyTTOo_SLhLm1e2m88](/tNiCdC%LW_ASRTPE_PSSI/ZsEi,z edosft(,u isnrtc6,4 _bty)t)e s{) ; | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :| 134 ^: 14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnote: :initialize the variable 'dst' to silence this warning402:3 : note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 134 | 402 | v o imds c*cdlsRtu,n I*nstrecr;p r e| t ^e r <| t = nullptry pe, Func##devredop, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpph:r1e: aIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:)13,: In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hi:d169I: n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hB:l271o:c19k:( twarning: hunused variable 'ptr' [-Wunused-variable]r eadIdx. x271) | , g r o u p ( gurionutp6)4,_ t *| ^~~~~~~~~~~~~~~~~p tr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :=562 :r60e:c vnote: Pfield 'group' will be initialized after field 'stepSize't r(0)+ l562l | 1 2 8 O ftfisde(tt;i d )| , ^~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPIn file included from L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp_:K1E: RIn file included from N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hE:L13_: EIn file included from N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hT:R167Y: _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hF:U562N:C15_:D Ewarning: Vinitializer order does not match the declaration order [-Wreorder-ctor]R EDOP_TYPE (562M | i n , itnitd3(2t_itd,) ,f anltsher)e;a d s| ( ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:s402):,3 :t inote: dexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'I nBloc k402( | t h rmesacdcIldRxu.nxI)n,t egrrporuept(egrrt,e pPSriozteo(LnLc1c2l8S,h mfeuml.lcOopmsm>.(bcuofmfmS,i zaelsg[oN,C CwLo_rPkR)O;T O\_ S I| M ^P LE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/:N165C:C33L:_ Snote: Tuninitialized use occurs hereE PS/s i165z | e o f ( Tc)o)p y{T o S| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m e m| 8 group(group( tid%WARP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:S217I:Z57E:, note: din instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested heres t, s r217c | , bPyrtiemsi)t;i v e| s ^~~< T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:d162O:p5,: Fwarning: avariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]n As y162m | m e t r idcet,: 1 ,| ^~~~~~~P ro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:o165,: 330:> note: puninitialized use occurs herer ims 165| | ^ c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cppo:p5y:T9o:S hnote: min instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested heree m8(t i5d | % W A R P _ S I ZMES,C CdLs_tI,M PsLr_cK,E RbNyEtLe_sE)N;T R Y| _ ^~~F UNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple , fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreaIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpps:)1,: In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hi:d13I: nIn file included from B/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hl:o169c: k/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h(:t509h:r29e:a dwarning: Ifield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]d x.x), gr o507u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~, nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.he:s386[:N9C:C Lwarning: _variable 'wireOffset' set but not used [-Wunused-but-set-variable]P ROTO_ L386L | 1 2 8 ] /iNnCtC Lw_iSrTeEOPfSf/sseitz e=o fW(iurienWto6r4d_Pte)r)S l{i c e| * ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w a r| p group(group + 2*wid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h;: 217 :| 57 ^: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, fIn file included from al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpps:e1): ;/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 154| : ^10 : warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]: 402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 154 | 402 | c amssec c3l:R u n| I ^n terpreter, ProtoLL, false>' requested heree vred o5p | < t y p e > , PMrSoCtCoLL_LI1M2P8L,_ KfEuRlNlEOLp_sE>N(TcRoYm_mF,U NaCl_gDoE,V RwEoDrOkP)_;T Y\P E (| M ^i n, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hu:i165n:t333:2 _note: tuninitialized use occurs here, fa l165s | e ) ; c| o ^p yToS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hh:m399e:m38:( tnote: iexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd %WARP_ S399I | Z E ,m sdcsctl,R usnrIcn,t ebryptreest)e;r < t| y ^~~p e, F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hu:n162c:#5#:d ewarning: vvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]r ed o162p | < t y p ed>e,f aPurlott:o L L| , ^~~~~~~ f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hu:l165l:O33p:s >note: (uninitialized use occurs herec omm ,165 | a l g o ,c owpoyrTko)S;h m\e m 8| ( ^t id%WARP_SIZ/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hE:,165 :d33s:t ,note: uninitialized use occurs heres rc, b y165t | e s ) ; c o| p ^~~y ToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyTo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hS:h134m:e14m:8 (note: tinitialize the variable 'dst' to silence this warningi d%W A134R | P _ S I ZvEo,i dd s*td,s ts,r c*,s rbcy;t e s| ) ^; | | = nullptr ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp :v1o: iIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :*13d: sIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h,: 169*: s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hr:c509;: 29 :| ^warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] | = nullptr 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRIn file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cppn:I1n: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:r154p:r10e:t ewarning: rvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]< type, 154F | u n c # #cdaesver e3d:o p <| t ^y pe>, ProtoLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp1:258:,9 :f unote: lin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested herel Ops>( c5o | m m , a l g o ,M SwCoCrLk_)I;M P\L _ K| E ^R NEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h):=154=:310):, warning: gvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]r oup(gro u154p | ) , | c ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~a s e| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==33 : | ^ 510 | ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cppp:S5i:z9e:( nnote: cin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herec lShme m5. | c o m m . b u f fMSSiCzCeLs_[INMCPCLL__KPERRONTEOL__LELN1T2R8Y]_/FNUCNCCL__DSETVERPESD/OsPi_zTeYoPfE((uMiinnt,6 4u_itn)t)3 2{_ t ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f a l| s group(groupe ); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :217:57/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 405note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here3 : note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 217 | Pri m405i | t i vmessc#,d e1v,r ePdroopt> ,p rPirmost o S| i ^m ple, ProtoLL128, false>' requested hereN KSTE P5S | / M S C C L _ S LMISCCECSLT_EIPMSP,L _MKSECRCNLE_LS_LEINCTERSYT_EFPUSN>C,_ DfEuVlRlEODpOsP>_(TcYoPmEm(,M ianl,g ou,i nwto3r2k_)t;, \f a l| s ^e ); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ^165 :33: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnote: :uninitialized use occurs here402 :3: note: 165expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' | c402o | p y TmosSchcmleRmu8n(Itnitde%rWpArRePt_eSrI, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ho:t162o:L5L:1 2warning: 8variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized], f u162l | l O p s >d(ecfoamuml,t :a l g| o ^~~~~~~, w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ho:r165k:)33;: \ note: uninitialized use occurs here| ^ 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h6:4514_:t9*: pwarning: tvariable 'offset' set but not used [-Wunused-but-set-variable]r = re c514v | P t r ( 0i)n+tl lo1f2f8sOeftf s=e tt;i d ;| ^~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 154w:o10r:k )warning: ;variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] \ | ^ 154 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15c:a snote: efield 'nthreads' will be initialized after field 'tidInBlock' 3: | ^ 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cppt:i5d:)9,: nnote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereh read s5( | n t h r e a d s )M,S CtCiLd_IInMBPlLo_cKkE(RtNhErLe_aEdNITdRxY._xF)U,N Cg_rDoEuVpR(EgDrOoPu_pT)Y,P E (| M ^~~~~~~~~~~~~~~~~i n,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :u562i:n60t:8 _note: tfield 'group' will be initialized after field 'stepSize', fal s562e | ) ; | t ^i d(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 402n:t3h:r enote: aexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd s(nthr e402a | d s )m,s ctcildRIunnBIlnotcekr(ptrherteeard, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnote: :uninitialized use occurs here134 :14: 165note: | initialize the variable 'dst' to silence this warning cop y134T | o S h m evmo8i(dt i*dd%sWtA,R P*_sSrIcZ;E , | d ^s t ,| = nullptrs rc, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInIn file included from B/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cppl:o1c: kIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:h13r: eIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:I169d: x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h.:x509/:W29A:R Pwarning: _field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]S IZE), | 507 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | warp(tid/WARP_SIZE tid(t i509d | ) , n tfhlraegaTdhsr(enatdh(r(etaidds%)4,) =w=i3d)(,t igdr%oWuApR(Pg_rSoIuZpE)),, w| a ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~r p (| t warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3i d/WAR P510_ | S I Z E )s,t e p| S ~~~~~~~~~~~~~~~~~~i z e| ( stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)n ccl S508h | m e m . cwoamrmp.IbnuBflfoScikz(etsh[rNeCaCdLI_dPxR.OxT/OW_ALRLP1_2S8I]Z/EN)C,C L _| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T E P| S warp(tid/WARP_SIZE/ size o509f | ( u i n tf6l4a_gtT)h)r e{a d (| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| % group(group4 )==3), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ho:u217p:(57g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested herep ), | 217 ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | | P warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3r imiti v510e | s < T , sRteedpOSpi,z eF(anncAcslySmhmmeetmr.iccou,f f1S,i zPerso[tNoC,C L0_>P RpOrTiOm_sL L 1| 2 ^8 ]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp_:S5T:E9P:S /note: sin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested herei zeof (5u | i n t 6 4 _ t ) )M S{C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I M P| L group(group_ KERNEL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:E217N:T57R:Y _note: Fin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested hereU NC_ D217E | V R EPDrOiPm_iTtYiPvEe(sM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 4021:,3 :P rnote: oexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE't o, 0> 402p | r i mmss c c| l ^R unI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cppn:t5e:r9p:r enote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested heree rN,E LP_rEoNtToRLYL_1F2U8N,C _fDuElVlROEpDsO>P(_cToYmPmE,( Mailng,o ,u iwnotr8k_)t;, \f a l| s ^e ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr ), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* pIn file included from tr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp :=1 : rIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:v13P: tIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h(:0168): +/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hl:l1531:2148:O fwarning: funused variable 'data1' [-Wunused-variable]s et; | ^~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ VREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hE:N154T:R10Y:_ Fwarning: Uvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]N C_DEVRED O154P | _ T Y P Ec(aMsien ,3 :d o u| b ^l e, false);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp : 5| : ^9 : note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here: 399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'5 | 399 | mMsScCcClLR_uInMIPnLt_eKrEpRrNeEtLe_rEn,, PdrooutbolLeL,, ffaullsleO)p;s > (| c ^o mm, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hl:g402o:,3 :w onote: rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'k ); \ 402| | ^ msc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:l165R:u33n:I nnote: tuninitialized use occurs heree rp 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr reter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hn:I134n:t14e:r pnote: rinitialize the variable 'dst' to silence this warninge ter< t134y | p e , Fvuonicd# #*ddesvtr,e d*ospr ^, P| r = nullptro toSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouIn file included from p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp(:g1r: oIn file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:)13,: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h| : ^~~~~~~~~~~~~~~~~167 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56215::60 :warning: initializer order does not match the declaration order [-Wreorder-ctor]note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :u134i:n14t:3 2note: _initialize the variable 'dst' to silence this warningt data 1134, | f l a gv1o,i dd a*tdas2t,, f*lsargc2;; | | ^ ^~~~~ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10:In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cppwarning: :variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :154169 | : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h : 509 :c29a:s ewarning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]3 : | ^ 507 | tid(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cppn:t5h:r9e:a dnote: sin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here( nthrea d5s | ) , w i d ( t iMdS%CWCALR_PI_MSPILZ_EK)E,R NwEaLr_pE(NtTiRdY/_WFAURNPC__SDIEZVER)E,D O P| _ ~~~~~~~~~~~~~~~~~~T Y P| E stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)( Min ,508 | u i n t 6w4a_rtp,I nfBalloscek)(;t h r| e ^a dIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hx:/399W:A3R:P _note: Sexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'I ZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~399 | | warp(tid/WARP_SIZEm scc l509R | u n I n tfelrapgrTehtreera),, P r| o ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~t o L| L warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3, ful l510O | p s > ( csotmemp,S iazleg(on,c cwloSrhkm)e;m .\c o m| m ^. buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hz:e165s:[33N:C Cnote: Luninitialized use occurs here_ PROTO_ L165L | 1 2 8 ] /cNoCpCyLT_oSSThEmPeSm/8s(itziedo%fW(AuRiPn_tS6I4Z_Et,) )d s{t , | s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r c ,| group(groupb ytes); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :| 217 ^~~: 57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :217162 | : 5 :P rwarning: ivariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]m iti v162e | s < T , dReefdaOupl,t :F a n| A ^~~~~~~s ymm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:t165r:i33c:<1,1> ,note: uninitialized use occurs here1 , Prot o165, | 0 > pcroipmysT o S| h ^m /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5e:m98:( tnote: iin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hered %WA R5P | _ S I Z E , d sMtS,C CsLr_cI,M PbLy_tKeEsR)N;E L _| E ^~~N TRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | defaulIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, falset: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ , dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hllOps:>154(:c10o:m mwarning: ,variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] algo, w o154r | k ) ; \c a s| e ^ 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ d%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlockIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cppt:h1r: e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:d154I:d10x:. xwarning: )variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized], gro u154p | ( g r o ucpa)s,e 3| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp :s5t:e9p:S inote: zin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested heree (ncc l5S | h m e m . c o m mM.SbCuCfLf_SIiMzPeLs_[KNECRCNLE_LP_REONTTOR_YS_IFMUPNLCE_]D/ENVCRCELD_OSPT_ETPYSP/Es(iMzieno,f (fTl)o)a t{, f| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l s e| ) group(group; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h217::39957::3 :note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested herenote: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 217 | 399P | r i mmistcicvleRsur,e d1o,p ,, 0P>r optroiLmLs, f| u ^l lOps>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp(:c5o:m9m:, note: ain instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herel go, w5o | r k ) ; \ | M ^S CCL_IMPL_K/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hE:R165N:E33L:_ Enote: Nuninitialized use occurs hereT RY_FUNC_ D165E | V R E D OcPo_pTyYTPoES(hMmienm,8 (ftlioda%tW,A RfPa_lSsIeZ)E;, d| s ^t , s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:c405,: 3b:y tnote: eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | iIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ nt wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp::1541:: 10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 154warning: :variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]10 : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | 154 | c a s e c3a:s e | 3 ^: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp :55 | : 9 : note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here MS C5C | L _ I M P L _ K EMRSNCECLL__EINMTPRLY__KFEURNNCE_LD_EEVNRTERDYO_PF_UTNYCP_ED(EMViRnE,D OhPa_lTfY,P Ef(aMlisne,) ;h a l| f ^, fals/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:)402;: 3 :| ^note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399: 3402: | note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'm scclR u399n | I n tmesrcpcrleRtuenrIr,e dPorpo2,8 ,P rfoutlolLOLp,s >f(uclolmOmp,s >a(lcgoom,m ,w oarlkg)o;, \w o r| k ^) ; \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165| : ^33 : note: uninitialized use occurs here 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165 : 33c:o pnote: yuninitialized use occurs hereT oShmem8 (165t | i d % W AcRoPp_ySTIoZSEh,m edms8t(,t isdr%cW,A RbPy_tSeIsZ)E;, d| s ^~~t , sr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:,162 :b5y:t ewarning: svariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]) ; 162| | ^~~ defaul/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht::162 : 5| : ^~~~~~~ warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]: 165:33: 162note: | uninitialized use occurs here d165e | f a u l tc:o p y| T ^~~~~~~o Shm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:m1658:(33t:i dnote: %uninitialized use occurs hereW ARP_S I165Z | E , d scto,p ysTrocS,h mbeymt8e(st)i;d % W| A ^~~R P_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ Y_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr YPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ erpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVRreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr EDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 154c:o10p:y Twarning: ovariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]S hmem8 (154t | i d % W AcRaPs_eS I3Z:E , | d ^s t, src,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp :b5y:t9e:s )note: ;in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here | ^~~ 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:o134t:o14S:i mnote: pinitialize the variable 'dst' to silence this warningl e, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp :d1e: fIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hu:l13t: :In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :| 169 ^~~~~~~: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h509::16529::33 :warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]note: uninitialized use occurs here 165 | 507 | c o ptyiTdo(Sthimde)m,8 (nttihdr%eWaAdRsP(_nStIhZrEe,a ddss)t,, wsirdc(,t ibdy%tWeAsR)P;_ S I| Z ^~~E ), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:m134.:c14o:m mnote: .initialize the variable 'dst' to silence this warningb uffS i134z | e s [ N CvCoLi_dP R*OdTsOt_,L L*1s2r8c];/ N C| C ^L _ S| T = nullptrE PS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp : 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 13 : MIn file included from S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hC:C167L: _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L15_:K Ewarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]N EL_ENTRY_F U562N | C _ D E VtRiEdDOP(_tTiYdP)E,( Mnitnh,r eracdcsl(_nbtfhlroeaatd1s6),, ftaildsIen)B;l o c| k ^( thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:a399d:I3d:x .note: xexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE') , gr o399u | p ( gmrsocucpl)R,u n I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t e r| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eter< t563y | p e , Fsutnecp#S#idzeev(rnecdcolpSc,o mPmr.obtuofLfLS,i zfeusl[lNOCpCsL>_(PcRoOmTmO,_ SaIlMgPoL,E ]w/oNrCkC)L;_ S\T E P| S ^/ siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:o165f:(33T:) )note: uninitialized use occurs here{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 | | group(group copyTo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hS:h217m:e57m:8 (note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested herei d%WA R217P | _ S IPZrEi,m idtsitv,e sswarning: ,variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 1, P162r | o t o , d0e>f apurlitm:s | | ^~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp::335:: 9note: :uninitialized use occurs here note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 165 | 5 | c o p y T o SMhSmCeCmL8_(ItMiPdL%_WKAERRPN_ESLI_ZEEN,T RdYs_tF,U NsCr_cD,E VbRyEtDeOsP)_;T Y P| E ^~~( Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [100%] Linking CXX shared library librccl.so /usr/bin/cmake -E cmake_link_script CMakeFiles/rccl.dir/link.txt --verbose=1 /usr/bin/cmake -E time /usr/bin/clang++ -fPIC -pipe -frecord-gcc-switches -Wall -g -O2 -parallel-jobs=16 -Xoffload-linker -mllvm=-amdgpu-kernarg-preload-count=16 -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0 CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o CMakeFiles/rccl.dir/hipify/src/channel.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o CMakeFiles/rccl.dir/hipify/src/debug.cc.o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o CMakeFiles/rccl.dir/hipify/src/group.cc.o CMakeFiles/rccl.dir/hipify/src/init.cc.o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o CMakeFiles/rccl.dir/hipify/src/net.cc.o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o CMakeFiles/rccl.dir/hipify/src/transport.cc.o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o CMakeFiles/rccl.dir/git_version.cpp.o -fgpu-rdc -ldl /usr/lib64/librocm_smi64.so.1.0 /usr/lib64/libamdhip64.so.6.1.40093 --hip-link --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 /usr/lib/llvm-rocm/lib64/clang/17/lib/linux/libclang_rt.builtins-x86_64.a -lpthread -lrt -ldl Elapsed time (seconds): 494.099 /usr/bin/cmake -E cmake_symlink_library librccl.so.1.0 librccl.so.1 librccl.so gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[1]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [100%] Built target rccl gmake[1]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -E cmake_progress_start /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles 0 gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' + exit 0 Executing(%install): /bin/sh -e /usr/src/tmp/rpm-tmp.31154 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + /bin/chmod -Rf u+rwX -- /usr/src/tmp/rccl-buildroot + : + /bin/rm -rf -- /usr/src/tmp/rccl-buildroot + PATH=/usr/libexec/rpm-build:/usr/src/bin:/usr/bin:/bin:/usr/local/bin:/usr/games + cd rccl-2.18.6 + DESTDIR=/usr/src/tmp/rccl-buildroot + cmake --install x86_64-alt-linux --verbose -- Install configuration: "" -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1.0 -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1 -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so -- Installing: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h -- Installing: /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-0-1kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-1kb-4kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-1mb-4mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-4kb-8kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-4mb-16mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-8kb-1mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-allpairs-16n-16tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-16tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-32tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-1pass.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-0-1kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-1kb-4kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-1mb-4mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-4kb-8kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-4mb-16mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-8kb-1mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll128.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-targets.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-targets-noconfig.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-config.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-config-version.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/doc/rccl/LICENSE.txt + rm -rf /usr/src/tmp/rccl-buildroot/usr/rccl + rm -rf /usr/src/tmp/rccl-buildroot/usr/share/doc/rccl + /usr/lib/rpm/brp-alt Cleaning files in /usr/src/tmp/rccl-buildroot (auto) mode of './usr/lib64/librccl.so.1.0' changed from 0755 (rwxr-xr-x) to 0644 (rw-r--r--) Verifying and fixing files in /usr/src/tmp/rccl-buildroot (binconfig,pkgconfig,libtool,desktop,gnuconfig) Checking contents of files in /usr/src/tmp/rccl-buildroot/ (default) Compressing files in /usr/src/tmp/rccl-buildroot (auto) Adjusting library links in /usr/src/tmp/rccl-buildroot ./usr/lib64: (from :0) librccl.so.1 -> librccl.so.1.0 Verifying ELF objects in /usr/src/tmp/rccl-buildroot (arch=normal,fhs=normal,lfs=relaxed,lint=relaxed,rpath=normal,stack=normal,textrel=normal,unresolved=normal) section [ 3] '.dynsym': symbol 338 (__hip_fatbin): symbol in dynamic symbol table with non-default visibility verify-elf: WARNING: ./usr/lib64/librccl.so.1.0: eu-elflint failed Splitting links to aliased files under /{,s}bin in /usr/src/tmp/rccl-buildroot Processing files: librccl1-2.18.6-alt0.1 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.81699 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd rccl-2.18.6 + DOCDIR=/usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + export DOCDIR + rm -rf /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + /bin/mkdir -p /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + cp -prL README.md LICENSE.txt NOTICES.txt CHANGELOG.md /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + chmod -R go-w /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + chmod -R a+rX /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.PXFfYO find-provides: running scripts (alternatives,debuginfo,lib,pam,perl,pkgconfig,python,python3,shell) lib.prov: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1: 192 symbols, 18 bpp Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.1yHHXg find-requires: running scripts (cpp,debuginfo,files,lib,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks,systemd-services) warning: librccl1 provides another subpackage: rccl Provides: rccl = 2.18.6-alt0.1, librccl.so.1()(64bit) = set:ldySY8WxOALBnhFpKYr8hTuOp4f4mGu2jLdMJjcZCXM47UXuwyyGRGWXKgETcgdjMi5wuDQ3qOxtZBm81J7pYPMIUZa5VdctQkKefUrjndPqhuFfak8KACxDBZ2WZJDfvJzZ89VmVuIkNiinUuRvWX09AlpiViW0mDiqb8i3YJossrximfgU5FDIg3bfAM3p87RAKcG4MZinBzsSGNgsBCROo9k0v79172vNT21EO938Mcw8TzCb018bhHvvzgmTvhhNQWFQoI4SSRedfYZyMcS4HABqmacW4xzCUZaO5x9LSUxVFl0qy5C7FFGgAn04Hyxww4hPwz6LsL4UDEnEe2dpGZx29zB56rIHYGcZG1BqjQafIX1WE3sbDhXCpfBjMq4 Requires: ld-linux-x86-64.so.2()(64bit) >= set:jiids, ld-linux-x86-64.so.2(GLIBC_2.3)(64bit), libamdhip64.so.6()(64bit) >= set:mgEl4iHah5shPP2z5A5zYttYI7XpZyRnhe1J6ZgwULwPlWeYZ4XbZd2bItRMqeW4hZmmUYmDZdpDnrYqkUKOuzfUwKzIyQItN97gggSsa6v6KYBa3m70aJ49gh1ckMQcuEPMZKgWZw, libamdhip64.so.6(hip_4.2)(64bit), libamdhip64.so.6(hip_4.3)(64bit), libamdhip64.so.6(hip_4.5)(64bit), libamdhip64.so.6(hip_5.0)(64bit), libamdhip64.so.6(hip_5.3)(64bit), libamdhip64.so.6(hip_6.0)(64bit), libc.so.6(GLIBC_2.14)(64bit), libc.so.6(GLIBC_2.17)(64bit), libc.so.6(GLIBC_2.2.5)(64bit), libc.so.6(GLIBC_2.3)(64bit), libc.so.6(GLIBC_2.3.2)(64bit), libc.so.6(GLIBC_2.3.4)(64bit), libc.so.6(GLIBC_2.33)(64bit), libc.so.6(GLIBC_2.34)(64bit), libc.so.6(GLIBC_2.38)(64bit), libc.so.6(GLIBC_2.6)(64bit), libgcc_s.so.1(GCC_3.0)(64bit), libm.so.6(GLIBC_2.2.5)(64bit), librocm_smi64.so.1()(64bit) >= set:miSwa9ZECgdMsH9hGiyEU5mNQ1, libstdc++.so.6(CXXABI_1.3)(64bit), libstdc++.so.6(CXXABI_1.3.5)(64bit), libstdc++.so.6(CXXABI_1.3.7)(64bit), libstdc++.so.6(GLIBCXX_3.4)(64bit), libstdc++.so.6(GLIBCXX_3.4.11)(64bit), libstdc++.so.6(GLIBCXX_3.4.18)(64bit), libstdc++.so.6(GLIBCXX_3.4.19)(64bit), libstdc++.so.6(GLIBCXX_3.4.21)(64bit), libstdc++.so.6(GLIBCXX_3.4.22)(64bit), libstdc++.so.6(GLIBCXX_3.4.29)(64bit), rtld(GNU_HASH) Requires(rpmlib): rpmlib(SetVersions) Finding debuginfo files (using /usr/lib/rpm/find-debuginfo-files) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.ADM5p1 Creating librccl1-debuginfo package Processing files: librccl-devel-2.18.6-alt0.1 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.yJ8ckA find-provides: running scripts (alternatives,debuginfo,lib,pam,perl,pkgconfig,python,python3,shell) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.FXas8V find-requires: running scripts (cpp,debuginfo,files,lib,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks,systemd-services) In file included from /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h:12: /usr/include/hip/hip_runtime.h:66:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 66 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:70: /usr/include/hip/hip_runtime_api.h:8852:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 8852 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:71: /usr/include/hip/library_types.h:75:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 75 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:73: /usr/include/hip/hip_vector_types.h:38:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 38 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h:13: /usr/include/hip/hip_fp16.h:33:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 33 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ cpp.req: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h: cpp failed, trying c++ mode x86_64-alt-linux-cpp: fatal error: cannot execute 'cc1plus': execvp: No such file or directory compilation terminated. cpp.req: WARNING: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h: cpp failed /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h:10:10: fatal error: nccl.h: No such file or directory 10 | #include "nccl.h" | ^~~~~~~~ compilation terminated. cpp.req: WARNING: /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h: cpp failed Provides: rccl-devel = 2.18.6-alt0.1 Requires: /usr/lib64/librccl.so.1 Finding debuginfo files (using /usr/lib/rpm/find-debuginfo-files) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.Soq24O Processing files: librccl1-debuginfo-2.18.6-alt0.1 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.JvrVuk find-provides: running scripts (debuginfo) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.Cf6oac find-requires: running scripts (debuginfo) Provides: debug64(librccl.so.1) Requires: librccl1 = 2.18.6-alt0.1, debug64(ld-linux-x86-64.so.2), debug64(libamdhip64.so.6), debug64(libc.so.6), debug64(libgcc_s.so.1), debug64(libm.so.6), debug64(librocm_smi64.so.1), debug64(libstdc++.so.6) Adding to librccl1-debuginfo a strict dependency on librccl1 Adding to librccl-devel a strict dependency on librccl1 Removing 1 extra deps from librccl-devel due to dependency on librccl1 Wrote: /usr/src/RPM/RPMS/x86_64/librccl1-2.18.6-alt0.1.x86_64.rpm (w2T16.xzdio) Wrote: /usr/src/RPM/RPMS/x86_64/librccl-devel-2.18.6-alt0.1.x86_64.rpm (w2T16.xzdio) Wrote: /usr/src/RPM/RPMS/x86_64/librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm (w2.lzdio) 21073.42user 966.31system 26:00.27elapsed 1412%CPU (0avgtext+0avgdata 5528492maxresident)k 2784inputs+0outputs (65major+87103677minor)pagefaults 0swaps /.out/librccl1-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // /.out/librccl-devel-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // /.out/librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // 7.96user 5.27system 27:42.39elapsed 0%CPU (0avgtext+0avgdata 136472maxresident)k 1769184inputs+0outputs (0major+336629minor)pagefaults 0swaps